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(57) Abstract 

The invention relates to methods of detection of bacteria of the genus Shigella and/or Enteroinvasive E. coli (EIEC) by use 
of a set of nucleic acid probes. The invention further relates to a set of Shigella specific chromosomal sequences and fragments 
and to probes derived from the Shigella specific fragments. Additionally, probes were derived from a sequence from the Shigella 
ompA gene. In particular, a series of probes, each approximately 40 nucleotides in length, were designed having specificity for 
Shigella or for Shigella and Enteroinvasive E. coli, and having utility in nonisotopic test formats which require amplification to 
achieve high sensitivity. Specific hybridization probe sets which are capable of detecting substantially all clinically significant ser- 
otypes of Shigella, as well as enteroinvasive strains of E. coli, are disclosed. 
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NUCLEIC ACID PROBES FOR THE DETECTION OF SHIGELLA 

Description 

Background of the Invention 

The genus Shigella includes four species (major 

05 serogroups) : S. dysenteriae (Grp. A) , S. flexneri 
(Grp. B) , S. boydii (Grp. C) and S. sonnei (Grp. D) 
as classified in Bergey's Manual for Systematic 
Bacteriology (N.R. Krieg, ed. , pp. 423-427 (1984)). 
These serogroups are further subdivided into 

10 serotypes (Table 1) . The genera Shigella and 
Escherichia are phylogenetically closely related. 
Brenner and others have suggested that the two are 
more correctly considered sibling species based on 
DNA/DNA reassociation studies (D.J. Brenner, et ah , 

15 International J. Systematic Bacteriology , 23 : 1-7 

(1973)). These studies showed that Shigella species 
are on average 80-89% related to E. coli at the DNA 
level. Also, the degree of relatedness between 
Shigella species is on average 80-89%. Shigella 

-0 boydii serotype 13 is atypical in that it is only 
65% related to other Shigella serotypes and 
Escherichia . 

. - The genus Shigella is pathogenic in humans; it t 

^ causes dysentery at levels of infection of 1C to 100 

25 organisms. By contrast, the majority of E. coli 
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(9000 0:H serotypes) are not associated with 
diarrheal disease. Pathogenic E. coli serotypes are 
collectively referred to as Enterovirulent E. coli 
(EVEC) fJ.R. Lupski, et al . , J. Infectious Diseases , 

05 157:1120-1123 (1988); M.M. Levine, J. Infectious 

Diseases , 155 :377-389 (1987); M.A. Karmali, Clinical 
Microbiolocry Reviews , 2:15-38 (1989)). This group 
includes at least 5 subclasses of E. coli , each 
having a characteristic pathogenesis pathway 

10 resulting in diarrheal disease. The subclasses 
include Enterotoxigenic E. coli (ETEC) , 
Verotoxin-Producing E. coli (VTEC) , Enteropathogenic 
E. coli (EPEC) , Enteroadherent E. coli (EAEC) and 
Enter o invasive B« coli (EIEC) . The VTEC include 

15 Enterohemorrhagic E. coli (EHEC) since these produce 
verotoxins - 

The pathogenesis of En tero invasive E. coli is 
very similar to that of Shigella- In both, 
dysentery results from invasion of the colonic 

20 epithelial ceils followed by intracellular 
multiplication which leads to bloody, mucous 
discharge with scanty diarrhea. 

Thus, detection of Shigella and EIEC is 
important in various medical contexts. For example, 

25 the presence of either Shigella or EIEC in stool 
samples is indicative of gastroenteritis, and the 
ability to screen for their presence is useful in 
treating and controlling that disease. Detection of 
Shigella or EIEC in any possible transmission 

30 vehicle such as food is also important to avoid 
spread of gastroenteritis. 
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Currently, presence of Shigella in stool 
samples is detected by cultivating an appropriately 
prepared sample on microbiological media under 
conditions favorable for growth of those bacteria. 
The resulting colonies are then examined for 
microbiological and biochemical characteristics, a 
process that typically takes at least three days and 
does not permit processing large numbers of samples. 
However, hospitals do not test for the presence of 
!0 EIEC in stool because of che difficulty of 

serotyping which is necessary to identify the EIEC 
among the numerous, non-pathogenic E. coli normally 
present in stool. 

Summary of the Invention 

15 The present invention relates to methods of 

detection of bacteria of the genus Shigella and/or 
Enteroinvasive E. coli (EIEC) by use of a set of 
nucleic acid probes. The invention further relates 
to a set of Shigella specific chromosomal sequences 

20 and fragments, which were isolated by subtractive 
hybridization against non-Shigella DNA, and to 
probes derived from the Shigella specific fragments. 
Additionally, probes were derived from a sequence 
from the Shigella ompA gene. In particular, a 

25 series of probes, each approximately 40 nucleotides 
in length, were designed having specificity for 
Shigella or for Shigella and Enteroinvasive E. coli , 
and having utility in non-isotopic test formats 
which require amplification to achieve high 

30 sensitivity. In addition, specific hybridization 
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probe sets were developed which are capable of 
detecting substantially all clinically significant 
serotypes of Shigella, including S. sonnei , S. 
f lexneri , S. boydii , and S. dysenteriae , as well as 

05 entero invasive strains of E. coli . 

Probes or probe sets of the present invention 
can be used in a number of hybridization formats for 
the detection of Shigella species and/or 
Entero invasive E. coli . For example, the presence 

10 of one or more Shigella species and/or one or more 
species of EIEC in a sample can be determined by 
lysing the cells in the sample, contacting the 
sample with a DNA probe set under conditions 
suitable for hybridization of the probes with 

15 shigella and/or EIEC DNA, capturing the hybrids 
formed between the probes and the sample DNA, and 
detecting the hybrid complexes by a suitable method 
as an indication of the presence of Shigella or EIEC 
in the sample. 

20 Brief Description of the Figures 

Figure 1 is a flow chart illustrating a 
strategy for the isolation of Shigella specific DNA 
sequences from "target" DNA and of Shigella specific 
fragments from a library of target DNA clones. 

25 Figure 2 illustrates the nucleotide sequence 

(SEQ ID NO:l) of Shigella specific fragment NT 6 and 
some flanking sequence, and the locations and 
sequences of probes 1500 (SEQ ID NO:14), 1501 (SEQ 
ID N0:15) and 1911 (SEQ ID NO:16), which are derived 

30 from these sequences. 
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Figure 3 illustrates the nucleotide sequence of 
Shigella specific fragment NT11-2 (SEQ ID NO:2), and 
the locations and sequences of probes 1682 (SEQ ID 
NO: 17), 1683 (SEQ ID NO: 18), 1708 (SEQ ID NO: 19), 

05 and 1709 (SEQ ID NO:20), derived from the fragment. 

Figure 4 illustrates the nucleotide sequence of 
Shigella specific fragments NT14 (SEQ ID NO: 4) and 
NT15 (SEQ ID NO:3), comprising a version of the 
Class III repeat isolated from S. sonnei , and the 

10 location and sequence of probes 1864 (SEQ ID NO: 22) 
and 437 (SEQ ID NO:21), derived from these 
fragments. The sequences of three versions of the 
Class III repeat isolated from E. coli (E.c. 1; SEQ 
ID NO: 5; and E.c. 2; SEQ ID NO: 6 and SEQ ID NO: 7) 

15 and S. flexneri (S.f.; SEQ ID NO:8 and SEQ ID NO: 9) 
are also shown. 

Figure 5 illustrates the sequence of Shigella 
specific fragment NT18-la (SEQ ID NO: 10), and the 
location and sequence of probes 1712 (SEQ ID NO: 23) 

20 and 1713 (SEQ ID NO: 24), derived from the fragment. 
Figure 6 illustrates the sequence of Shigella 
specific fragment NT19-2 (SEQ ID NO: 11) , and the 
location and sequence of probes 1684 (SEQ ID NO: 
25) and 1685 (SEQ ID NO: 26), derived from the 

25 fragment. 

Figure 7 illustrates a portion of the sequence 
of the S. dysent: ?riae ^S.d.) ompA gene (SEQ ID NO: 
12) , and the location and sequence of probes 1706 
(SEQ ID NO: 27) and 1707 (SEQ ID NO: 28), derived 
30 from the ompA gene sequence. The E. coli (E.c.) 
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ompA gene sequence (SEQ ID NO: 13) corresponding to 
the same region is shown for comparison. 

Detailed Description of the Invention 

Previous investigators interested in developing 

05 specific probes for Shigella have targeted the 

virulence plasmid. Both Shigella and EIEC harbor a 
single copy virulence plasmid approximately 14 0 MD 
(215 kilobasepairs) in size which is necessary for 
invasion (T.L. Hale, Infection and Immunity , 

10 40:340-350 (1983)). For example, U.S. Patent No. 
4,816,389 (Sansonetti et al .) discloses a 27 
kilobasepair (kb) region of the virulence plasmid 
proven necessary for invasion. These Investigators 
have shown that the virulence plasmid, which exists 

15 as one copy per bacterium, is unstable; regions of 
the plasmid may be deleted. Shigella and EIEC 
strains, which have been stored or passaged in the 
laboratory, frequently are found to contain 
virulence plasmids of reduced size (C. Sasakawa, et 

20 al. , Infection and Immunity , 51:470-475 (1986); A.T. 
Maurelli, et al . , Infection and Immunity , 
42:397-401)). The 27 kb region from which the 
probes of Sansonetti et al . have been derived has 
been shown to be one of the unstable segments (P.K. 

25 Wood, J. Clinical Microbiology , 24:498-500 (1986)). 
Strains of Shigella or EIEC which do not contain a 
portion of the 27 kb target region are not detected 
by the probe and are incorrectly identified as 
non-Shigella or non-EIEC. 
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Moreover, the 27 kb probe region contains 
insertion element 1 (IS1) which is ubiquitous among 
the Enterobacteriaceae (M. Venkatesan, et al . , J\ 
Clinical Microbiology , 2j6:261-266 (1988)). It also 
05 contains at least one copy of insertion element 600 
(IS600; S. Matsutani, et al . , J. Molecular Biology , 
196:445-455 (1987)), which occurs frequently in both 
pathogenic and non-pathogenic representatives of E. 
coli, as well as in Shigella and EIEC (unpublished 
10 result) . The presence of these broadly distributed 
insertion elements and the large size of the probes 
designed from the 27 kb region decrease the utility 
of these probes in non-isotopic test formats which 
require amplification in order to achieve high 
15 sensitivity. 

In contrast, the present invention relates to 
probes and probe sets which are (1) developed from 
Shigella specific fragments derived from chromosomal 
sequences of Shigella and (2) moderate in size, each 
20 probe being approximately 40 bases in length. The 
increased stability of the chromosomal sequences 
detected by the probes compared to sequences of the 
virulence plasmid can result in increased 
reliability of detection. Furthermore, moderately 
25 sized probes have utility in non-isotopic test 

formats which require amplification to achieve high 
sensitivity. Both the Shigella specific fragments 
and the probes derived from them are also useful as 
hybridization probes in other formats. Some of the 
30 fragments derived from the Shigella chromosome are 



BNSDOCID:<WO 9303167A2> 



WO 93/03187 _ ^ PCT/US92/06617 



-8- 



also present in an episomal location on the invasion 
plasmid or another plasmid. 

In one embodiment, the invention features a 
nucleic acid probe set consisting essentially of 
05 nucleic acids with sequences that are: 

a. derived from the chromosomal sequence of 
representative bacteria of the species Shigella 
sonnei (ATCC 29930, designated type strain) and 
Shigella flexneri type 2a (ATCC 29903, 

10 designated type strain) but are less than the 

entire chromosomal sequence of these bacteria; 

b. capable of hybridizing to DNA of members of the 
four known Shigella species and to 
Enteroinvasive E. coli (EIEC) ; 

15 c, not capable of hybridizing or only weakly 
hybridizing to DNA of bacteria that are in 
neither the genus Shigella nor the group EIEC. 

As used herein, a sequence fragment or 
oligonucleotide that is "derived from a chromosomal 

20 sequence" is a natural, engineered or synthetic 
molecule having a sequence which is identical or 
complementary to a . chromosomal sequence or is 
identical or complementary to a variant of the 
chromosomal sequence. A sequence fragment or 

25 oligonucleotide which is identical or complementary 
to a variant of a selected chromosomal sequence 
(i.e., the variant differs in sequence from the 
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chromosomal sequence) is a homologue of the sequence 
or oligonucleotide which is identical or 
complementary, respectively, to the selected 
chromosomal sequence. This type of homologue will 
05 have a nucleotide sequence substantially similar to 
a chromosomal sequence and will retain the desired 
function (will be able to hybridize to substantially 
the same nucleic acids as the sequence fragment or 
oligonucleotide which is identical or complementary 
i0 to that chromosomal sequence under similar 
hybridization conditions) of the fragment or 
oligonucleotide which is identical to the selected 
chromosomal sequence , A homologue may differ from 
the chromosomal sequence in sequence and/or may 
15 contain modified nucleotides or nucleotide analogs 
(e.g., phosphorothioates, methylphosphonates) . A 
homologue must be able to hybridize to the same 
nucleic acid as the sequence fragment or 
oligonucleotide which is identical or complementary 
20 to that chromosomal sequence under similar 

hybridization conditions. It is well known to those 
skilled in the art that either strand of a 
double-stranded DNA sequence can serve as the target 
for a complementary probe. The complement of a 
25 given probe is expected to have a substantially 
similar hybridization pattern, under similar 
hybridization conditions. The probes may be DNA or 
RNA or modified DNA or RNA. 

Nucleic acid fragments or oligonucleotides 
30 containing sequences derived from a chromosomal 

sequence, their homologues, and complements of all 
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of the foregoing may be used as probes. For 
example, the Shigella specific fragments, portions 
thereof and oligonucleotides from the Shigella 
specific fragments may be used as hybridization 
05 probes. 

In one embodiment, either of two probe sets of 

synthetically produced nucleic acid probes (each 

approximately 4 0 nucleotides long) will detect 

substantially all clinically significant serotypes 
10 of Shigella, including S. sonnei , S. f lexneri , S_. 

boydii and S. dysenteriae . Clinically significant 

serotypes or isolates are those which are human 

pathogens. In addition, these probe sets recognize 

some or all enteroinvasive strains of E. coli , 
15 exclusive of other enteric bacteria tested, with the 

exception of Escherichia ferqusonii . 

The invention further relates to methods of 

detecting Shigella species or EIEC in a sample. For 

example, one or more Shigella serotypes and/ or EIEC 
20 present in a sample can be detected by lysing the 

cells in the sample; contacting the sample with a 

nucleic acid probe or probes of the present 

invention under conditions that allow the probes to 

hybridize to Shigella and EIEC DNA in the sample, 
25 thus forming hybrid nucleic acid complexes; 

isolating hybrid nucleic acid complexes formed 

between the probes and DNA in the sample, and 

detecting the hybrid nucleic acid complexes as an 

indication of Shigella or EIEC in the sample. For \ 
30 example, clinical (e.g., stool), environmental (e.g. 
water) , or food specimens may be subjected to such a 
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procedure to ascertain the presence of Shigella 
and/or EIEC species in the sample. 

In preferred embodiments of the method, one or 
more pairs of probes are selected as a capture and 

05 detector probe pair for use in a dual probe liquid 
hybridization format. These probes can be produced 
synthetically by chemical or enzymatic synthesis 
methods. They may be produced as part of a larger 
molecule. For example, the capture probe can be 

!0 tailed with 150-200 deoxyadenosine triphosphate 
(dATP) residues using the enzyme terminal 
deoxynucleotidyl transferase. The detector probe 
can be incorporated into an amplification/detection 
system, such as a biotin-streptavidin-alkaline 

!5 phosphatase system. Both the capture and detector 
probes are then allowed to hybridize to the target 
nucleic acid in a background of competitor nucleic 
acid from the sample. The hybrid products can be 
captured out of the mixture by magnetic beads 

20 complexed with tails of deoxythymidine monophosphate 
generally 14 residues longer. The captured hybrids 
(i.e., those affixed to or captured on the magnetic 
beads) are detected by means of the 
amplification/detection system selected. 

Isolation of Shigella-Specif ic Fragments 

Shigella-specif ic genomic sequences were 
isolated by subtractive hybridization using 
\ biotin-streptavidin agarose affinity column 

chromatography methods described by Langer et al . 
30 (Langer, P.R. et al . , Proc. Natl. Acad. Sci. USA , 
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78:6633-6637 (1981) and Welcher et al . (Welcher, 
A. A. et al ., Nucleic Acids Res. , 14: 10027-10044 
(1986)), each of which is incorporated herein by 
reference. The method is outlined in Figure 1. 

05 For example, a mixture of competing DNAs, such 

as the complex DNA competitor mix described in 
Figure 1 can be used. In general, a complex DNA 
competitor mix will contain an excess of DNA 
(relative to the target DNA) from one or more 

!0 Enterohemorrhagic E. coli isolates, one or more 
Enterotoxigenic E. coli isolates, and one or more 
non-pathogenic E. coli isolates. The mix may 
further contain specific sequences already recovered 
or sequences designed to eliminate non-specific 

15 sequences. For example a plasmid containing a 17 kb 
region of the Shigella virulence plasmid such as 
pHS4033, Class III repeat DNA, or M13mpl8 RF DNA may 
be included. 

The particular complex DNA competitor mix used 

20 contained a 6-fold excess by weight of DNA (relative 
to the target DNA) from each of Enterohemorrhagic E^ 
coli isolate IG 3040, Enterotoxigenic E. coli 
isolate IG 3060, and non-pathogenic E. coli (YMC) . 
The mix further contained DNA, in amounts equal to 

25 the target DNA by weight, from each of the 

following: a pBR322 clone containing a 17 kb region 
of the Shigella virulence plasmid (plasmid pHS403 3, 
Boileau, C.R. et al ., J. Clin. Microbiol . 20(5) : 
959-961 (1984)), M13mpl8 RF DNA, and the sequence of 

30 the Class III repeat (Class IIIR-IG900) cloned into 
pBR322. (The 1.3 kb Class III repeat and adjacent 
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chromosomal DNA was cloned into pBR322. The length 

of the insert was 3.5 kb.) The complete mixture was 

labelled with biotin-ll-dUTP by the nick-translation 

method. The "target DNA" from which specific 

05 sequences were identified was isolated from a single 

Shigella species, digested with restriction 

32 

endonuclease Sau3A, and end-labelled with P in an 
end-filling reaction with DNA polymerase I. 

The two DNA pools were combined such that the 

10 competing DNAs were at a 20-40 fold molar excess 
relative to the Shigella DNA. The mixture was 
denatured and hybridized in liquid at low stringency 
overnight. The hybridization buffer contained 0.75 
M NaCl, 50 mM NaPC> 4 , 1 mM EDTA, 0.05% SDS and 4 0% 

15 formamide. The hybridization mixture then was 
passed over the streptavidin agarose column. 
Shigella DNA sequences that were sufficiently 
complementary to the competing DNAs to form hybrids 
under the conditions used were retained on the 

20 streptavidin agarose affinity column by virtue of 
the biotin incorporated into the competitor DNA. In 
contrast, sequences that were unable to form hybrids 
under those conditions were enriched in the nucleic 
acid fraction passing through the column. The 

25 latter sequences contain Shigella-specif ic 
sequences. 

A small aliquot of the DNA enriched fcr 
Shigella-specif ic sequences (previously 

3 2 

P-labelled) was used to probe Shigella and £. coli 
30 DNAs of interest (0.1 ^g of each DNA was spotted in 
a 3 4 volume on nitrocellulose) . Hybridization 



BNSDOCt0:<WO 93031 87A2> 



WO 93/03187 _ ^ PCTAJS92/06617 



-14- 



conditions were those described below in Example II 
for nick-translated fragments. Cross-hybridization 
to the competitor E. coli provided an indication 
that further enrichment of the Shigella "target" DNA 

05 was required. Typically, four cycles of competition 
hybridization versus the biotinylated E. coli 
competitor DNA were necessary to eliminate the 
cross-hybridization. This was accomplished by 
repeating the competition hybridization/af f inity 

10 capture cycle, using the Shigella DNA which passed 
through the column during the previous cycle, as the 
starting material for the next cycle. In this way, 
the labeled Shigella target DNA was progressively 
enriched for Shigella-specif ic sequences in each 

15 cycle. 

The nucleic acid which was enriched for 
Shigella-specif ic sequences was then used to probe a 
Sau3A library of the same Shigella isolate used as 
the "target DNA" in the subtractive hybridization; 

20 the library was constructed in the plasmid vector, 
pUCIS- Inserts (fragments) from positive clones 
were purified from the vector, labelled with 32 P by 
nick-translation and used to probe mini-cyto-dot 
panels of inclusivity and exclusivity organisms as 

25 described in Examples I and II. A probe shows 
inclusivity toward an organism if DNA from the 
organism hybridizes to the probe, and shows 
exclusivity toward an organism if the probe does not 
hybridize (or if hybridization is barely detectable) 

30 to DNA from that organism under the particular 
hybridization conditions used. 
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In most cases, hybridization to inclusivity 
organisms was much stronger than hybridization to 
competitors or exclusivity organisms. When 
necessary, subcloning was done to remove the 

05 sequences which cross-hybridized to competitors. At 
this point, the fragments were labelled with P by 
nick-translation and used to probe full inclusivity 
and exclusivity cyto-dot panels. 

The inclusivity panels consisted of bacteria 

1° representing all known Shigella serotypes as well as 
Enteroinvasive E. coli which exhibit the same 
pathogenesis as Shigella. These organisms are 
listed in Tables 2 through 6. The exclusivity 
panels consisted of non-pathogenic E. coli , 

15 Enterotoxigenic E. coli , Enteropathogenic E. coli , 
Enterohemorrhagic E. coli , other Escherichia species 
and gram negative Enterobacteriaceae. The 
exclusivity organisms are listed in Tables 6 and 
7a/7b. 

20 In the Tables, under the conditions used, (-) 

indicates' no signal, (+/-) or (-/ + ) was barely 
detectable, (+) indicates a weak but reproducible 
and readily detectable signal, (++) indicates a 
moderate signal, (+++) indicates a strong signal, 

2j and (++++) indicates a signal comparable to the 
positive control (DNA from the organism from which 
the fragment was isolated, or a known sequence 
identical to the probe being tested) . The genomic 
(chromosomal) DNA fragments identified by the 

30 subtractive hybridization and refinement protocols 
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described above are referred to as Shigella specific 
fragments . 

1. Shigella specific fragments NT19-2 (SEQ ID NO: 
11) and NTlS-la (SEQ ID NO: 10) were isolated 

05 from a library of Shigella flexneri (ATCC 

29903) genomic clones by probing the library 
with Shigella specific sequences isolated from 
S. flexneri (ATCC 22903) 32 P-labeled "target" 
DNA, using the "complex DNA competitor mix 11 

10 described above in the subtractive 

hybridization steps. 

2. Shigella specific fragments NT 6 (see SEQ ID 
NO:l), NT11-2 (SEQ ID NO:2), NT14 (SEQ ID 
NO: 4) and NT15 (SEQ ID NO: 3) were isolated 

15 from a library of Shigella sonnei (ATCC 29930) 

genomic clones by probing the library with 
Shigella specific sequences isolated from S. 
sonnei (ATCC 29930) 32 P-labeled "target" DNA 
and using a single non-pathogenic E. coli ( YMC) 

20 and 1 tig of pBR322 vector DNA as source of 

competitor DNA in the subtractive 
hybridization- In this case, avidin agarose 
rather than streptavidin agarose was used in 
the affinity column. 

25 The hybridization results against inclusivity and 
exclusivity organisms for these Shigella-specif ic 
fragments are recorded in Tables 2-7. In addition, 
a set of ompA probes were developed from published 
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data (G . Braun, et al . , Nucleic Acids Research , 
10:2367-2378 (1982)) by comparison of the outer 
membrane protein gene sequence (ompA) of Shigella 
dysenteriae (SEQ ID NO: 12) with that of E. coli (SEQ 
°5 id NO: 13). The regions of ompA gene sequence which 
appeared most different between the two sequences 
were selected for the development of test probes. 
These were synthesized and assayed by hybridization 
analysis as described above. 

10 Isolation of Shiqella-Specif ic Oligonucleotides 

The Shigella-specif ic fragments described above 
displayed the most marked specificity for 
inclusivity organisms of the Shigella DNA /Vagments 
analyzed. These fragments were sequenced anu 

15 oligonucleotide probes useful as capture and 

detector probes were designed from these sequences. 
Oligonucleotide capture and detector probes were 
also designed from a fragment of the ompA sequence 
by comparative sequence analysis. Following 

20 synthesis, the oligonucleotides were end-labelled 

32 . . 

with P and tested by cyto-dot hybridization 

analysis as described in Example I to ensure that 

they exhibited the desired inclusivity and 

exclusivity behavior or pattern. In several cases, 

25 an additional exclusivity panel was tested at this 
point (Table 8) . This panel consisted of 4 ^g DNA 
dots of gram positive and gram negative 
bacteria — including aerobic and anaerobic 
representatives commonly found in stool. The DNA 

30 was isolated by a protocol which makes use of glass 
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beads to physically disrupt the cell wall of 
bacteria. Each DNA was spotted on nitro-cellulose 
filters in a 3 m1 volume; the DNAs were denatured, 
neutralized and fixed as described in Example I for 
05 the preparation of cyto-dot panels. 

Description of Probes and Hybridization Behavior 
With Respect to Indus ivity and Exclusivity 

Oligonucleotide probes were derived from the 
Shigella-specif ic fragments identified by affinity 

10 chromatography and from the ompA sequence. The 
sequence of each oligonucleotide probe and the 
Shigella-specif ic DNA fragment that each was derived 
from are listed in Table 9, The inclusivity and 
exclusivity hybridization behavior of the clones and 

15 subclones, and the oligonucleotides designed from 
these sequences are described below. 

Fragment NTS and Probes 1500, 1501, and 1911 
NT6 (see SEQ ID NO:l) is a 124 bp Sau3A 
Shigella specific fragment. The sequence is 

20 repeated 6 times in Shigella sonnei (ATCC 2993 0) 
chromosomal DNA. It is also found in one or two 
copies on the virulence plasmid of other Shigella 
isolates. The entire fragment was sequenced (Figure 
2) . The first 124 bp of Figure 2 are from NT 6 (see 

25 SEQ ID N0:1). The Sau3A site at the 3 '-end of the 
NT-6 sequence is indicated. In this and other 
Figures, IUPAC conventions for referring to 
nucleotides and sequence ambiguities are used: 
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Unambiguous bases: A, C, G, and T (or U) 

Two possible bases: M (A or C) 

R (A or G) 

W (A or T) 

05 S (C or G) 

Y (C or T) 
K (G or T) 

Three possible bases: B (C, G or T) 

D (A, G or T) 

10 H (A, C or T) 

V (A, C or G) 

Four possible bases: N (A, C, G or T) 



Where there is a weak band on the sequencing gel, 
but no other band, the base is indicated by a lower 

15 case letter. Regions of ambiguous nucleotide order 
due to band compression are enclosed in parentheses. 

Two oligonucleotides derived from NT6, each 35 
bases long (1500, SEQ ID NO: 14) and 1501, SEQ ID NO: 
15) , were designed and can be used as capture and 

20 detecti on probes (Table 9) • A third probe, 4 0 bases 
long (1911, SEQ ID N0:16), was designed from NT 6 and 
from additional sequence adjacent to the Sau3A NT-6 
fragment (38 bp; see SEQ ID NO:l). This additional 
sequence was obtained from a clone isolated from an 

25 S. sonnei library using the NT-6 124 bp fragment as 
a probe. A sequencing primer internal to NT-6 was 
used for sequencing. Oligonucleotide 1911 was 
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designed and tested. The hybridization pattern of 
1911 was identical to that of NT6, suggesting it is 
part of the repeated element. 

The individual oligonucleotides were tested for 

05 hybridization to inclusivity and exclusivity 
organisms. In cases where all three probes 
displayed the same hybridization pattern, the 
results are not listed separately. The individual 
oligonucleotides, like the parent fragment, 

10 hybridized (using a + signal as the lower limit of 
hybridization) to all S. sonnei isolates tested, 
many S. dysenteriae and S. boydii , S. flexneri 
serotypes, including S, flexneri type 6 (Tables 2 
through 5 and summary Table 10) . The 

15 oligonucleotide probes derived from NT6 hybridized 
to all Enteroinvasive E, coli , but not to other 
classes of pathogenic E. coli , non-pathogenic E. 
coli or other organisms commonly found in stool 
(Tables 6-8) . 

20 Fragment NT11-2 and Probes 1682, 1683, 1708 and 1709 
NT11-2 (SEQ ID NO: 2) is a 796 bp Hha I subclone 
of an original Sau3A fragment (NTH) which was 3.5 
kb in length. Fragment NT11-2 has been sequenced 
(Figure 3; SEQ ID NO: 2). Two sets of 

25 oligonucleotide probes, useful as a capture/ detector 
probe pair were designed from opposite ends of the 
fragment. Probes 1682 (SEQ ID NO: 17) and 1683 (SEQ 
ID NO:18) are 41 nucleotides long (Table 9). Probes 
1708 (SEQ ID NO: 19) and 1709 (SEQ ID NO:20) are 35 

30 and 3 6 nucleotides long, respectively. When used as 
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capture/ detector probes, each capture probe can be 
synthesized with an additional three nucleotides 
(TAT) at the 3' end to ensure efficient tailing by 
the enzyme terminal deoxynucleotidyl transferase* 

05 Tables 2 through 5 and summary Table 10 show that 
the two sets of oligonucleotides have different 
hybridization patterns. The cumulative 
hybridization pattern using the four probes together 
is the same as that of the parental fragment, NT 

10 11-2, which hybridizes to all S. sonnei isolates, 
and to some S. dysenteriae , S. flexneri and S. 
boydii isolates- However, when used as 
capture/detector probe pairs (i.e., 1682 paired with 
1683 or 1708 paired with 1709) , certain serotypes 

15 would not be detected because one partner of each 
probe set does not hybridize to the isolate (e.g., 
S. dysenteriae types 9 and 10, S. boydii type 17). 
One of the four Enteroinvasive E. coli is detected 
strongly with 1708/1709 and weakly with 1682/1683. 

20 Two pathogenic E. coli isolates would be detected by 
1708/1709 while only one of these isolates would be 
detected with 1682/1683. Apart from these 
pathogenic isolates, the oligonucleotides do not 
cross-hybridize to other non-pathogenic E. coli or 

25 other organisms commonly found in stool (Tables 6 
and 7a) . 

Fragments NT14 and NT15, and Probes 437 and 1864 

NT14 (SEQ ID NO: 4) and NT15 (SEQ ID NO : 3 ) are 
Sau3A fragments which are approximately 800 bp and 
30 600 bp in size, respectively. The two fragments 
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have been sequenced* and are portions of a highly 
repeated element which is 1.3 kb in length. One 
representative repeat each from S. sonnei (ATCC 
29930) and S. flexneri (ATCC 29903) and two repeats 
05 from E. coli (IG900) were cloned and sequenced 

(Figure 4; S, sonnei repeat, SEQ ID NO: 3 and SEQ ID 
NO: 4; E. coli repeat 1, SEQ ID NO: 5; E. coli repeat 
2, SEQ ID NO: 6 and SEQ ID NO: 7; S. flexneri repeat, 
SEQ ID NO: 8 and SEQ ID NO: 9). The repeat is highly 
10 conserved and has characteristics of a transposable 
element. Over 2 0 copies of the repeat sequence are 
present in the chromosome and virulence plasmid of 
Shigella. The repeat occurs in 1 to 3 copies in 
some B. coli competitors, but not in other bacterial 
15 species. 

There are only a few differences between the E. 
coli and S. sonnei sequences shown in Figure 4 . A 
17 base oligonucleotide probe (probe 1864, SEQ ID 
NO: 22) was designed such that a single mismatch is 

20 located eight bases from either end of the probe 
(Table 9) . This probe hybridizes strongly to the 
majority of Shigella and EIEC tested and does not 
cross-hybridize to competitor bacteria (Tables 2-8 
and summary Table 10) . A companion detector probe 

25 can be designed within the boundaries of the Class 
III repeat on either side of the specific capture 
probe, 18 64. One such example is the complement of 
probe 437 (SEQ ID NO: 21) (49 bases) for which 
inclusivity and exclusivity is listed in Tables 2 

3° through 8. This probe hybridizes strongly to all 
serotypes of Shigella except S. boydii serotype 13. 
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Probe 4 37 hybridizes weakly to a number of 
non-pathogenic and pathogenic strains of E. coli , 
but does not hybridize to other Enterobacteriaceae 
tested. The hybridization signal with certain E^ 

05 coli is due to the low copy number of the Class III 
repeat in this genus versus the high copy number of 
the repeat in the genus Shigella. 

Two extra bases incorporated into the sequence 
of probe 437 as a result of sequencing errors in the 

10 S. sonnei sequence (a G and a T) have proved useful 
in decreasing the signal for E. coli isolates 
relative to Shigella strains. The 437 probe does 
not hybridize as well as expected to the positive 
control for S. sonnei or S. f lexneri , suggesting 

15 that the two nucleotides in question are not present 
in the S. sonnei sequence. This observation also is 
likely to be related to copy number differences 
between the two genera. 

Probe 1864 (SEQ ID NO: 22) hybridized to all 

20 isolates of S. dysenteriae tested except for one 
isolate of serotype 1 (IG 826). However, serology 
on this isolate was not confirmed. 

Fragment NT18-la and Probes 1712 and 1713 

NT 18-la (SEQ ID NO: 10) was subcloned from the 

25 original Sau3A fragment, NT18, in two steps. A 
Pstl/SacI double digest of NT18 (1500 bp) yielded 
fragment NT18-1 (1250 bp), which was then restricted 
with Haelll to generate NT18-la (630 bp) . Sequences 
related to fragment NT18 are also known to occur on 

30 a small multicopy plasmid which is distinct from the 
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215 kbp invasion plasmid. The sequence of NT18-la 
is shown in Figure 5 (SEQ ID NO: 10). 
Oligonucleotide probes 1712 (SEQ ID NO: 23) and 1713 
(SEQ ID NO:24), suitable as capture/detection 
05 probes, both 37 bases long, were designed from the 
sequence (Figure 5 and Table 9) . The hybridization 
pattern (isolates detected) of the oligonucleotide 
probes was identical to the parental Shigella 
specific fragment NT18-la with the exception of one 
10 S. flexneri isolate (Tables 2-5 , 7b). This strain 
(IG 711) was detected by oligonucleotide probe 1713, 
but not by probe 1712. In a liquid hybridization 
assay uh'ifi the two probes would be used as a 
capture /detection probe pair this organism would not 
15 be detected. Under the conditions used, the probes 
hybridize to 6/8 type 1 S. dysenteriae , to all S. 
flexneri isolates with the exceptions of three type 
6 isolates, the IG711 isolate mentioned above, 
IG872, IG741, and IG709 (Tables 2 through 5 and 
20 summary Table 10). The probes do not detect 

Enter o invasive E* coli , but cross-hybridize to one 
pathogenic E. coli under the conditions used. They 
do not cross-hybridize to non-pathogenic E. coli or 
other bacteria commonly found in stool (Tables 6 
2 5 through 8 ) . 

Fragment NT19-2 and Probes 1684 and 1685 

Fragment NT19-2 (388 bp; SEQ ID NO: 11) is an 
Rsal subclone of the original Sau3A fragment which 
was 1070 bp in length. NT19-2 was sequenced (Figure 
30 6; SEQ ID NO: 11) and oligonucleotide probes 1684 
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(SEQ ID NO:25) and 1685 (SEQ ID NO:26), each 35 
nucleotides long, were designed (Table 9) . These 
probes are suitable as capture/detection probes. 
The hybridization patterns or spectrum of isolates 

05 detected by the individual oligonucleotides and the 
parental fragment are identical. Hybridization to 
some S. boydii , some S. sonnei and all S. flexneri 
except type 6 was observed (Tables 2-5, summary 
Table 10) . The probes hybridize to one out of five 

10 Enteroinvasive E. coli , and do not cross-hybridize 
to other pathogenic E. coli , non-pathogenic E. coli 
or other bacteria commonly found in stool (Tables 
6-8) . 

ompA Fragment and probes 1706 and 1707 

15 Oligonucleotides 1706 (SEQ ID NO: 27) and 1707 

(SEQ ID NO: 28) were designed from the published 
sequence of the outer membrane protein gene (ompA) 
of Shigella dysenteriae. Figure 7 shows the S. 
dysentaeriae ompA gene sequence from nucleotide 

20 position 893 through 1076, according to the 

numbering of Braun et al . ( Nucl. Acids Res . 10 (7) : 
2367-2378 (1982); SEQ ID N0:12). This region 
contains significant differences between the E. coli 
and S. dysenteriae ompA coding sequences. The 

25 sequence of the corresponding region of the E. coli 
ompA gene is shown for comparison (SEQ ID NO:13), 
and the positions of probes 1706 (SEQ ID NO: 27) and 
1707 (SEQ ID NO: 28) are indicated in Figure 7. 

Both oligonucleotides are 35 bases long (Table 

30 9) . Probe 1706 has 7 differences between the E. 
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coli and the S. dysenteriae sequence. The region 
from which probe 1707 was designed is 15 bases 
shorter in E. coli . Additionally, there are 
numerous differences in that probe site between E. 

05 coli and S. dysenteriae . These probes hybridize 
with S. dysenteriae types 1 and 2 and many S . boydii 
serotypes (Tables 2-5 , summary Table 10), When 
probe 17 07 is used as the specific capture probe and 
1706 is used as the detector probe in liquid 

10 hybridization, no hybridization is anticipated to 
Enteroinvasive E. coli , other pathogenic E. coli , 
non-pathogenic E. coli and other bacteria commonly 
found in stool with the exception of Escherichia 
fergusonii (Tables 6-8) . 

15 Description of Probe Sets 

Probe Set I 

A desired inclusivity/exclusivity pattern may 
be achieved by use of various combinations of 

20 probes. One possible strategy involves pooling one 
or more combinations of probe pairs to make a probe 
set. For example, capture/detection 
oligonucleotides designed from fragments NT 6 (SEQ 
ID NO:l), NT 19-2 (SEQ ID NO: 11) and the ompA gene 

25 (SEQ ID NO: 12) may be used together as a probe set 
or combination for detection of substantially all 
clinically significant Shigella serotypes. One 
possible probe set comprises three capture/detection 
probe pairs, including probe pairs 1684/1685, 

30 1707/1706, and a pair selected from probes 
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1500/1911/1501. This substantially inclusive probe 
set detects all Shigella serotypes except for S. 
dysenteriae type 10 and S. boydii type 13 (using a + 
signal as a lower limit for hybridization) . In 
05 addition, this probe set does not cross-hybridize 
under the conditions used to any of the competitors 
tested except for Escherichia f erqusonii . 

The two Shigella serotypes that are not 
detected by probe set I under these conditions, are 

10 rarely isolated in the United States, as indicated 
by records of the Center of Disease Control. For 
example, out of a total of 167,915 cases of Shigella 
infection reported by the Center for Disease Control 
for the years 1976 through 1987, only two cases were 

15 identified as S. dysenteriae type 10 and three cases 
as S. boydii type 13. 

In a capture/detection assay format, the more 
specific oligonucleotide of a capture/detector pair 
is preferred as the capture probe. In the case of 

20 the 1684/1685 probe set, either oligonucleotide may 
serve as the capture probe with equivalent 
hybridization results. However, in the case of the 
1707/1706 probe set, oligonucleotide 1707 is 
preferred as the capture probe since it does not 

25 cross-hybridize to competitors. (Probe 1706 has 
(+/-) hybridization signal with certain competitors 
and a strong signal with an E. blattae isolate 
(Tables 6, 7b) ) . 

In the case of the 1500/1911/1501 

30 oligonucleotide probes, it is best to use either 

1500 or 1911 as the capture probe. Probe 1501 has a 
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(+/-) hybridization signal with certain competitors 
(Table 8) , but may be used as a detector probe with 
no adverse effects. 

Probe Set II 

05 An alternative probe set combines 

capture/detection oligonucleotide probe pairs from 
the Class III repeat (fragments 14 and 15; SEQ ID 
NO: 4 and SEQ ID NO: 3, respectively) and the ompA 
gene (SEQ ID NO: 12). This substantially inclusive 

10 probe set 1707/1706 and 1864/437-complement) 

hybridizes to all Shigella except S. boydii type 13 
and cross-hybridizes weakly to Escherichia 
f erqusonii * 

In the case of the capture/ detector pair 

!5 1864/437-complement, it is best to use 

oligonucleotide 1864 as the capture probe since it 
is the more specific probe of the pair. 
Oligonucleotide probes designed from sequences to 
the left or right (in the 5' or 3 ' direction and 

20 from the same strand) of the sequences from which 
probe 1864 was derived may also serve as detector 
probes. The complement of probe 437 is one such 
example, and is expected to substantially retain the 
hybridization pattern of probe 437. 

25 Use of probe combination II requires only four 

oligonucleotides instead of six, yet gives the 
desired inclusivity and exclusivity (substantially 
inclusive, in this case) . In addition, the target 
of probe 18 64 is present in multiple copies (2 0-3 0 

30 copies), and therefore, allows for increased 
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sensitivity. However, carefully controlled 
hybridization conditions are necessary to maintain 
exclusivity with probe set II, since the specificity 
of probe 1864 depends on a single mismatch to 

05 differentiate between Shigella and E. coli which 
harbor the repetitive element (Class III repeat) . 

Table 12 lists the number of isolates expected 
to be detected by probe set I and probe set II in a 
capture/detection format using a (++) hybridization 

10 signal as the cut-off for detection. The results 
for probe sets I and II in a capture/detection 
format are expected to be the same when a ( + ) signal 
is used as the cut-off for detection. 

Additional Probes 

15 Other probes (double- or single-stranded 

nucleic acid fragments or oligonucleotides) , probe 
sets or combinations may be derived from the 
Shigella specific fragments. These fragments or 
oligonucleotides (probes) "derived from the sequence 

20 o'f Shigella specific fragments" , comprise nucleic 
acid sequences which are identical or complementary 
to a portion of the sequence of the Shigella 
specific fragments (and therefore to the Shigella 
chromosome) * In some cases, only a portion of the 

25 probe may be identical to the sequence of the 

original Shigella specific fragment. Portions of a 
probe which are identical or complementary to the 
sequence of a Shigella specific fragment can be 
noncontiguous in the probe. 
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The preferred probes will retain features of 
the inclusivity, and if desired, the exclusivity of 
the Shigella specific fragments. A probe derived 
from a Shigella specific fragment which 

05 "substantially retains the inclusivity behavior of" 
a selected Shigella specific fragment hybridizes, 
under the same conditions, to at least one isolate 
of 90% or more of the (typed) serotypes to which the 
original fragment hybridizes. An original Shigella 

10 specific fragment which hybridizes to at least one 
isolate of one of the 3 5 serotypes listed in Table 1 
is said to hybridize to that serotype. A probe 
which "moderately retains the inclusivity behavior 
of" a selected Shigella specific fragment 

15 hybridizes, under the same conditions, to at least 
one isolate of 83% or more, but less than 90%, of 
the serotypes to which the original fragment 
hybridizes. A probe which "partially retains the 
inclusivity behavior of" a selected Shigella 

20 specific fragment hybridizes, under the same 

conditions, to at least one isolate of 50% or more, 
but less than 83%, of the serotypes to which the 
original fragment hybridizes. 

Exclusivity was determined using two sets of 

25 exclusivity organisms. The exclusivity organisms 
screened included 152 non-EIEC strains listed in 
Tables 6, 7 A and 7B, and defined here as non-EIEC 
Enterobacteriaceae exclusivity organisms. In 
addition, the 91 strains listed in Table 8, comprise 

30 a second set of exclusivity organisms defined here 
as exclusivity organisms commonly found in stool. 
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A probe derived from a Shigella specific 
fragment which has "improved" exclusivity behavior 
for a given set of exclusivity organisms (e,g,, 
non-EIEC Enterobacteriaceae) is one for which all of 
05 the exclusivity organisms of that set have been 

screened in the dot blot format, and which detects 
(hybridizes with a signal of (+) or better) fewer 
exclusivity organisms of that set than the Shigella 
specific fragment from which it is derived, under 
10 the same hybridization conditions. A probe derived ' 
from a Shigella specific fragment which, for a given 
set of exclusivity organisms, "substantially 
retains" the exclusivity behavior of the fragment 
from which it is derived, is one for which 90% or 
15 more of the exclusivity organisms of that set have 
been screened in the dot blot format, and which has 
substantially the same or identical exclusivity 
behavior under the same hybridization conditions. 
In particular, a probe which will detect no more 
20 than 13 strains of a set of exclusivity organisms 

which are not detected by the fragment from which it 
is derived, and which may or may not detect the 
exclusivity organisms which are detected by the 
original fragment, is defined as one which 
2^ "substantially retains" the exclusivity behavior of 
that fragment. It will be appreciated that a probe 
for which exclusivity has been determined for 100% 
of a given set of exclusivity organisms, but which 
detects the same number of exclusivity strains or 
30 more (but not more than 13 additional strains) of 
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the exclusivity organisms as the original fragment 
falls in this latter category. 

Furthermore, a probe for which exclusivity has 
not been determined for 100% of the organisms may be 

05 shown to have "improved" exclusivity behavior. For 
example, probes 1684 (SEQ ID NO:25) and 1685 (SEQ ID 
NO: 26), for which the exclusivity for 4 non-EIEC 
Enterobacteriaceae has not been determined, do not 
detect two strains in Table 6 which are detected by 

10 NT19-2 (SEQ ID NO: 11). If it is determined that 
these probes do not detect three or four of the 
strains not tested, then they will have improved 
exclusivity behavior, although they are presently 
classified as substantial . y retaining the 

15 exclusivity of NT19-2. Thus, the two 

classifications are not mutually exclusive. 

Homologues of fragments and oligonucleotides 
derived from Shigella specific fragments, which 
hybridize to substantially the same serotypes as the 

20 fragments and oligonucleotides derived from Shigella 
specific fragments under the same hybridization 
conditions, can also be used. Homologues of a 
sequence fragment or oligonucleotide derived from a 
Shigella specific sequence will be identical or 

25 complementary to all or part of a variant of the 
sequence of a Shigella specific fragment. 

For example, oligonucleotide probes, typically 
from about 10 nucleotides in length up to about 50 
nucleotides in length, comprising sequences 

30 identical to a portion of a Shigella specific 
fragment can be designed. However, an 
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oligonucleotide probe may be longer than 50 
nucleotides. Larger fragments comprising a sequence 
identical to a portion of a full length fragment can 
be prepared by restriction digestion of an isolated 
05 clone, exonuclease digestion, by the polymerase 
chain reaction using selected primers, or other 
suitable methods, for example. 

The additional probes derived from the Shigella 
specific fragments, complements or homologues 
10 thereof, can be screened using a dot blot format 

(such as the cyto-dot or DNA dot blot formats of the 
Examples) . These additional fragments or 
oligonucleotide probes can be used alone or in 
various probe pairs or probe combinations, or in 
15 addition to a selected probe, probe pair or 

combination from Table 9. The additional probes can 
also be used as alternatives to the probes listed in 
Table 9. For example, another probe derived from 
Shigella specific fragment NT11-2 (SEQ ID NO: 2) 
20 could be used together with probe 1682 (SEQ ID 

NO:17) in place of probe 1683 (SEQ ID NO:18). For 
use in a capture/detection format, the probe would 
be derived from the same strand of the Shigella 
specific fragments as probe 1682. A probe which 
25 substantially retained the inclusivity pattern of 
NT11-2 could be selected, for example. 
Furthermore, probes of the present invention can be 
used in combination with other probes for Shigella, 
enteroinvasive E. coli , or other organisms (e.g., 
30 Salmonella, Campylobacter, etc.). 
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It will be appreciated that the signal for the 
recommended probe sets may be increased by using 
additional probes. Additional probes may be 
selected from the probes listed in Table 9, 

°5 oligonucleotide probes derived from the large 
Shigella specific fragments disclosed, the 
homologues and complements of any of the foregoing, 
or other suitable probes. Although some probes are 
preferred as detection probes in a capture-detection 

10 probe format due to hybridization with exclusivity 
organisms, each probe may be used as either a 
capture or detection probe. 

Inclusivity and Exclusivity Patterns 

Different inclusivity and exclusivity patterns 

15 can be obtained using selected combinations of 

probes. Furthermore,, inclusivity and exclusivity 
behavior may be modulated by hybridization 
conditions, and/or by taking a specific level of 
hybridization as the cut-off. For example, in Table * 

20 10, a (+) signal, which is a weak but reproducible 
and readily detectable signal, is used as the 
cut-off for inclusivity or detection in the dot blot 
format (cyto-dot or DNA dot format) . 

In Table 11, a (++) cut-off is used. In Table 

- 5 li, the number of isolates of each serotype or 
untyped isolate to which the probes NT6, NT11-2, 
NT18-la, NT19-2, hybridized with a signal of at 
least (++) is indicated. In addition, the expected 
number of isolates of each serotype or untyped 

^ isolates to which probe pairs selected from 
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1911/1500/1501, 1682/1683, 1708/1709, 1712/1713, 
1684/1685, or 1706/1707 are expected to hybridize 
with a signal of at least (++) in a 
capture/detection format are also indicated. (ND 
indicates that hybridzation was not determined for a 
particular isolate, ) 

Unless indicated otherwise herein, a probe 
which "detects" or for "the detection" of an isolate 
or serotype is one wh:nh gives at least a ( + ) signal 
under the hybridization conditions used with the 
isolate or with an isolate of a specific serotype. 
An individual probe (fragments or oligonucleotides) , 
probe oair or probe set (a combination of probes 
and/or probe pairs) which, under the conditions used 
in the dot blot format, detects (hybridizes with a 
signal of at least + ) at least one isolate of 90% or 
more of the serotypes listed in Table 1 is defined 
as a "substantially inclusive" probe, probe pair, or 
probe set. Similarly, a probe, probe pair, or probe 
set which, under the conditions used in the dot blot 
format, detects at least one isolate of 83% or more, 
but less than 90%, of the serotypes listed in Table 
1 is a moderately inclusive probe, probe pair or 
probe set. A probe, probe pair or probe set which, 
under the conditions used in the dot blot format, 
detects at least one isolate of 50% or more, but 
less than 83%, of the serotypes listed in Table l is 
a partially inclusive probe, probe pair or probe 
set. 

For example, probe sets I and II described 
above are substantially inclusive probe sets. Based 
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on the dot blot data (see summary Table 10) , these 
combinations are expected to detect at least one 
member of 3 3 and 34 out of 3 5 serotypes, 
respectively in a capture/detection format (94.2% 
05 and 97.1%). In fact, each of these probe sets is 
expected to detect every isolate tested of the 
serotypes detected. 

Individual probes such as oligonucleotides 
1900, 1500 or 1501, which detect at least one member 
10 of 60% of the serotypes listed in Table 1, or 

fragment NT-6, which detects approximately 68% of 
the serotypes listed in Table 1 (see Table 10) , 
would be considered partially inclusive probes. 
Probe 1864 is a substantially inclusive probe. 
15 Partially, moderately and substantially inclusive 
probes may be combined with each other or with other 
probes into appropriate pairs or sets to achieve a 
desired inclusivity pattern. 

As stated above, the exclusivity organisms 
20 screened include the 152 non-EIEC strains listed in 
Tables 6, 1 A and 7B, and defined here as non-EIEC 
Enterobacteriaceae exclusivity organisms. In 
addition, the 91 strains listed in Table 8, define a 
second set of exclusivity organisms defined here as 
25 exclusivity organisms commonly found in stool. An 
individual probe (fragments or oligonucleotides) , 
probe pair or probe set (a combination of probes 
and/or probe pairs) which, under the conditions used 
in the dot blot format, does not detect (hybridizes 
30 with a signal less than +) any of the 152 non-EIEC 
strains listed in Tables 6, 7A and 7B is defined as 
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an "exclusive 11 probe, probe pair or probe set with 
regard to the non-EIEC Enterobacteriaceae 
exclusivity organisms. A probe, probe pair, or 
probe set which, under the conditions used in the 
Tables, detects (hybridizes with a signal of (+) or 
better) 10% or less of the 152 non-EIEC strains 
listed in Tables 6 and 7A and B is defined as a 
"substantially exclusive" probe, probe pair or probe 
set with regard to these E. coli and 
Enterobacteriaciae exclusivity organisms, while a 
probe which detects 20% or less of the 152 non-EIEC 
strains listed in Tables 6 and 7A and B is defined 
as "moderately exclusive" of the non-EIEC 
Enterobacteriaceae exclusivity organisms, A probe 
which is "exclusive" also meets the criteria for a 
moderately or substantially exclusive probe. 

A probe, probe pair or probe set which, under 
the conditions used in the dot blot format, does not 
detect (hybridizes with a signal less than +) any of 
the 91 strains commonly found in stool and listed in 
Table 8 is defined as an "exclusive 11 probe, probe 
pair or probe set with regard to the exclusivity 
organisms commonly found in stool. A probe, probe 
pair, or probe set which, under the conditions used 
in the Tables, detects (hybridizes with a signal of 
(+) or better) to 10% or less of the 91 strains 
listed in Table 8 is defined as "substantially 
exclusive" of the exclusivity organisms commonly 
found in stool. Thus, a probe that is exclusive of 
the organisms in Table 8 is also substantially 
exclusive of the same organisms. 
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For example, probes 1500, 1501 or 1911 are 
exclusive of the non-EIEC Enterobacteriaceae of 
Tables 6 and 7, as well as of the strains commonly 
found in stool. Probes 1684 and 1685 did not detect 
05 any of the non-EIEC tested; however, 4 organisms 
(2.6%) were not tested. Thus, 1684 and 1685 are 
known to be substantially exclusive. If none of 
these 4 organisms is detected, then these probes 
would be exclusive. Probe set I, when used in a 

10 capture/detection format, is expected to detect 2 of 
the strains from Table 7 ( Escherichia fergusonii ) . 
However, four strains from Table 7 were not tested 
(ND) . Therefore, probe set I detects at most 3.9% 
(6/152) of the non-EIEC Enterobacteriaceae, and is 

15 thus "substantially exclusive" of the non-EIEC 

Enterobacteriaceae exclusivity organisms. Probe set 
I is also exclusive of the exclusivity strains 
commonly found in stool (listed in Table 8) . 
Probe 437 detects about 9.9% (15) of the 

2 0 non-EIEC organisms; however, 10/152 of the non-EIEC 
Enterobacteriaceae were not screened. Thus, the 
exclusivity of this probe for the non-EIEC 
Enterobacteriaceae is between 16.4% (moderately 
exclusive) and 9.9% (substantially exclusive). 

25 However, in a capture/detection format, probe set 
II, which may use the complement of 437, is 
substantially exclusive of the non-EIEC 
Enterobacteriaceae exclusivity organisms and 
exclusive of the exlusivity strains commonly found 

30 in stool due to the behavior of probe 18 64. The 
expected behaviour of probe pairs of probe sets I 
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and II in a capture detection format with respect to 
EIEC, non-EIEC Enterobacteriaceae, and strains 
commonly found in stool is summarized in Table 13, 
in which a (+) signal is used as the cut-off for 
05 detection. (In Table 13, as in Tables 11 and 12, 
any pair selected from probes 1500, 1501 and 1911 
displays the behavior indicated.) 

Assay Formats for Probes 

Probes, probe pairs, and probe sets can be used 

1° in a variety of hybridization assay formats. Such 
hybridization assays include solution hybridization 
assays in which the sequences to be detected and the 
probes are free in solution, or assays in which one 
of the sequence or probe is fixed to a solid 

15 support. Shigella specific fragments, portions 
thereof, oligonucleotide probes derived from the 
fragments, complements, or homologues can be used in 
dot blot formats or other appropriate 
hybridization-based assay formats. For example, the 

20 large fragments or portions thereof can be prepared 
as probes by nick translation or other suitable 
methods for filter hybridization (see e.g., U.S. 
Patent 4,358,535, Falkow et al . ) . 

The probes can be used in suitable capture 

25 detection assay formats (see e.g., D.V. Morrissey, 
et al . , Analytical Biochemi stry, 181: 345-359 (1989) ; 
W.R. Hunsaker, et al . , Analy tica l Biochemistry , 
181 : 360-370 (1989); H. Lomel.i, et al . , Clinical 
Chemistry , 35:1826-1831 (1989); Pritchard, C.G. and 

30 J. E. Stefano, Ann. Biol, clin. 48:492-497 (1990)). 
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In a capture/detector format, the probe pairs are 
preferably selected from non-overlapping portions of 
the same strand (a selected strand) of a Shigella 
specific fragment or a variant of a Shigella 

05 specific fragment. The probes can be separated by a 
distance consistent with activity in the selected 
assay. Thus, in- a standard capture /detect ion 
format, the probes should be close enough that 
sample preparation does not separate the 

10 complementary sequences to the extent that the 
desired sensitivity of detection is compromised. 

RNA probes may also be prepared. For example, 
probe nucleotide sequences can be incorporated into 
a fal-st MDV cDNA construct, and transcribed from 

15 linearized plasmid using T7 RNA polymerase. A 
detection probe prepared in this way can be used 
with one or more capture probes and amplified in Q/3 
replicase system (Pritchard, C.G. and J.E. Stefano, 
Ann- Biol. Clin. 48: 492-497 (1990))* 

20 The oligonucleotide probes described or others 

based on the sequence of the Shigella specific 
fragments can be used in the polymerase chain 
reaction. A second oligonucleotide can be prepared 
from the opposite strand. 

25 The present invention will now be illustrated 

by the following examples, which are not intended to 
be limiting in any way. 
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EXAMPLES 
Example 1 Cyto-dot Panels 

All cyto-dot panels were prepared by spotting 

8 

approximately 1 X 10 cells of each bacterial 
05 isolate in a 5 /il volume onto nitrocellulose. The 
bacteria were lysed and the DNA was denatured by 
placing the nitrocellulose filters on 3 MM paper 
wetted with 0.5 M NaOH and 1.5 M NaCl for 10 
minutes. Following this treatment, the 
10 nitrocellulose filters were neutralized by placing 
them on 3 MM paper wetted with 1 M Tris pH 7 . 5 and 
1.5 M NaCl for 10 minutes. The latter 
neutralization step was repeated, and the DNA was 
fixed to the filters by baking under vacuum for 
15 1-1.5 hours at 80 °C. 

Example 2 Hybridization Conditions 

The hybridization conditions for all 
nick-translated fragments were as follows: 
Prehybridization~was in 10X Denhardt's, 5X SET, 0.1 

20 M phosphate buffer pH 7 , 0.1% sodium pyrophosphate, 
0.1% SDS for 3 hours at 65 °C. (Note that 20X SET 
is 3M NaCl, 0.4M Tris-HCl pH 7.5 and 20mM EDTA) . 
Hybridization — was in 2X Denhardt's, 5X SET, 0.1 M 
phosphate buffer pH 7 , 0.1% sodium pyrophosphate, 

25 0.1% SDS and 1 X 10 6 counts of nick-translated probe 
per ml of hybridization solution. Hybridizations 
occurred overnight at 65 °c. The autoradiographs 
were exposed for 15 hours. 

The hybridization conditions for all kinased 

30 oligonucleotides (except probe 1864, a 17 base (b) 
oligonucleotide) were as follows: 
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Prehybridization — was in 5X Denhardt's, 6X SET, 0,1 
M phosphate buffer pH 7, 0.1% sodium pyrophosphate, 
0.1% SDS for 3 hours at 60 °C. Hybridization — was 
in IX Denhardt's, 6X SET, 0.1 M phosphate buffer pH 

05 7, 0.1% sodium pyrophosphate, 0.1% SDS and 1 X 10 6 
counts per minute of kinased oligonucleotide probe 
per ml of hybridization solution. Hybridizations 
occurred overnight at 60 °C- The autoradiographs 
were exposed for 15 hours or 7 days. The data 

10 recorded in Tables 2-8 are from 7 day exposures 

The results of the two exposures were similar. The 
hybridization conditions for probe 1864 (17 b) were 
identical to those above except that the prehybrid- 
ization, hybridization and wash temperatures were 50 

15 °C rather than 60 °c. 
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TABLE 1 

SIMPLIFIED OUTLINE OF SHIGELLA CLASSIFICATION 

SPECIES SEROLOGICAL SEROLOGICAL 
SUBGROUP TYPE (S) 

A i through 10 

1 through 6 
C 1 through 18 

D 1 



5 S. dysenteriae 
S. flexneri 
S . boydii 
S. sonnei 
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TABLE 6A : HYBRIDIZATION OF SHIGELLA PROBES TO E. COLI 



— — 1911 

strain NT 1500 NT 
Genus species ID 6 1501 11-2 1682 1683 1708 1709 



Enteroinvasive 
E. coli 



it 3138 ++++ ++++ - 

» 3145 ++++ ++++ +++ + +/- +++ ++++ 

it 3146 ++ ++++ - 

ii 3157 ++++ ++++ - 

it 3037 ++++ ++++ - 

Enterotoxigenic 
E. coli 

■t 3H8 - - 

» 3119 - - 

ii 3120 - - 

it 3123 - " 

it 3127 - ----- 

i, 312 9 - " 

" 3132 - ~ 

■< 3134 - " 

•> 3136 + " 

" 3142 - " 

" 3147 - " 

« 3151 - - 

ti 3154 - - - ND - - ~ 

» 3156 - - 

it 3158 + " 

it 3160 - " 
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1911 

Strain NT 1500 NT 
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TABLE 6 A : CONT'D 



1911 

Strain NT 1500 NT 
Genus species ID 6 1501 11-2 1682 1683 1708 1709 



Enteropathogenic 

E. coli non - 0157 serotype 

ii 846 ++++ 

» 847 

ii 848 

ii 3149 



Non-pathogenic 
E. coli 



» 3116 

» 3117 

» 3121 
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" 3126 

« 3128 

« 3130 

» 3131 

" 3133 

tt 3135 

" 3139 
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ti 3150 

tt 3153 

" 3158 

it 3161 
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TABLE 6B : HYBRIDIZATION OF SHIGELLA PROBES TO E. COLI 



Genus Strain NT 1712 1713 NT 1684 ompA CR3 

species ID 18-la 19-2 1685 1707 1706 437 1864 



Enteroinvas ive 
E. coli 

3138 +/----- ++++ 

3145 - - - +/- - - - + +++ 

" 3146 - - - ++++ ++++ + +++ 
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TABLE 6B : CONT'D 



Genus Strain NT 1712 1713 NT 1684 ompA CR3 

species ID 18-la 19-2 1685 1707 1706 437 1864 



Enteropathogenic 

E. coli - 0157 serotype 



It 
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Enteropathogenic 
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3039 +++ ++++ ++++ +/- - - _ + - 

3041 --- + -- - ++ND 

3042 - - - +/- ----- 

3043 - - - +/- - 

839 - +/- ----- 

840 - - - +/- ++ 

841 - - - +/- - 

842 - - - +/- - - -/ + 

843 - - - +/- _ 

844 - +/- - - - - -/ + 

845 - - - +/- + 
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Equivalents 

Those skilled in the art will recognize or be 
able to ascertain , using no more than routine 
experimentation, many equivalents to the specific 
05 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed within 
the scope of this invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Parodos, Kyriaki 
Mccarty, Janice 

(ii) TITLE OF INVENTION: Nucleic Acid Probes for the Detection of 
Shigella 

(iii) NUMBER OF SEQUENCES; 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amoco Corporation 

(B) STREET: 200 East Randolph Drive, P.O. Box 87703 

(C) CITY: Chicago 

(D) STATE i Illinois 

(E) COUNTRY: U.S.A. 

(F) ZIP: 60680 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US92/06617 

(B) FILING DATE: 28-JUL-92 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/738,800 

(B) FILING DATE: 31-JUL-1991 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Galloway, Norval B. 
(B> REGISTRATION NUMBER: 33,595 

(C) REFERENCE/DOCKET NUMBER: GTR90-04 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-856-7180 

(B) TELEFAX: 312-856-4972 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AATCCAACCG CAGTAATAAA CTGAATCCCT CGCATGGCTT GCAGCGCCTC TACTACCGGA 60 

TACAGCCTCC ATTCGGTAAC NGCCTCCTTG AGGGCGGATT CCAGCCGTTC ACATTGTGCC 120 

TGCCGATCTT CTATTGTACG ACGGTGTTCG TCAAAAGCTA ATTG 164 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CATCAGAAAT 


CTAAGCAGAA 


GTCTTTCCGT 


GTTTCTCAGA 


AATGGGGGCA 


ACGTGCAAAA 


60 


CTTGCCCTTG 


CTGGTGAACA 


ACGTCTTACA 


AAGATGGTTC 


CTGGATGGAT 


TGACCCTGAG 


120 


ACTTTTAAAC 


TCAATGAACA 


CGCTGAGACT 


GTGAGATTGA 


TATTCAAACT 


GCTGCTAGAT 


180 


GGTGAAAGTC 


TGCATAACAT 


TGCACGTCAC 


CTTCAAAGCA 


ACGGTATAAA 


GTCGTTTAGT 


240 


CGCCGTAAAG 


ATGCTAATGG 


GTTCTCTGTT 


CACTCTGTAC 


GCACATTCTA 


AGGTCAGAGC 


300 


AACAATAGGC 


ACGTTACCAG 


CATCACAACG 


TAATGACCGC 


CCCGCTATAC 


CGAACTACTA 


360 


CGAAGGTGTT 


GTAGATATAC 


CAACGTTCAA 


TAAAGCTCAA 


GAGATTCTCG 


ACAAGAATCG 


420 


TAAAGGCCGT 


AC ACCTG CAA 


GTGACAACCC 


ACTAACGATT 


AACATCTTCA 


AAGGTCTGTT 


480 


TAGGTGTCAG 


TGTGGGGCTA 


GTGTCCATCC 


TACCGGAACA 


AAGAATAAGT 


ATGCTGGGGT 


540 


CTACAGGTGC 


AATAACCACT 


TAGACGGTCG 


CTGTGATGTT 


CCACCGTTGA 


AGCGTAAACC 


600 


GTTTGACCGA 


TGGATGATTG 


ATAATTTTCT 


GGGGATGATT 


GACGTGGGGA 


ATGATGGAGA 


660 


ATCAGAGAGA 


AAGATTGCAG 


CGTTACAGCA 


TGAGGTTGAA 


ATTGTCACAG 


CCAGAATCAA 


720 


GAAACGTACC 


GCCCTACTTC 


TTGAGATGGA 


TGATATTGAT 


GAACTAAAAA 


TTCAGCTTAA 


780 


GGAACTGAAC 


CAGAAG 










796 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GATCTTTCTT CGAAGAAGAG AGCGCACCAA TACCCGCGCC CACGAGAGAG CCCAGACCTG 60 

CGCCGATAGC AGATTTGCCT GCTTCGCGTT CGCCGGTGTA AGGGTTAGTT GTGCAGCCAG 120 

ATACCGCCAG AGCGCCACTC ACTACGGAGG CAATAAGATA AACACGTTTG TTCATTGTTA 180 

ATCCTTCCTA ACCTTTTTAT TCTTTGCCAC GGGTTCCGTG GCGGGAGATT ATGCCGCGTG 240 

AACATGAAGA TGAGGTGTAC TGGCAATAGC GGACACTACC ATTTGTTCTT TTTTTAAGCA 300 
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GCCATCTGAT GATATTTTTC CCTGAAGGCT GCCGGGGAGA TATTCCCCAG ACGAGAGTGA 360 

CGACGCTGAC GATTCTAGAA AATCTCAATG TATTCCCGTA TTACTGAGAT GGCTTCATCC 420 

CGGTTATTAA AACGATAGTG GCTCAGGCTC TCATTTTTCA GCGTTCCCCA CAAGCTTTCC 480 

ATCGGAGCGT TGTCGTAACA GTTACCTTTA CGCGACATTG ATGTTTTCAG ACCAGACTGC 540 

TCCTGTATGA CCCGGTAATC GTATGCGCAG TACTGTGAAC CTCGATC 587 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GATCAGAGTG GTGGATTAGC CCGGCAGTGG GCGCTGGCTC CTGAGCGCCA TAAACAGGGC 60 

TTTACCTGTC AGCTCTTTTG TCATGCGCTC TCCCATGGCG TACGACAATT TCGCACGTAT 120 

AAACATCTTT GATGCCAGCG AGGTACAACC ATCCCTCCTG TGTGGCAACA TACGTCAGGT 180 

CCGCCACCCA GACCTGATTT GGTGCTGTAG GAGCGAACGT CTGGTTCAGC AGATTTGGCG 240 

CAACTGGCAG ATTGTGGTTC GAGTTCGTAG TCGCTCTGAA CTTGCGTTCT GCTTACAGCG 300 

TAGCTTAGCT CCTTACGAAG ACGTGCCAGT CGGTCACGAC CAACGATGAT GCCATTCTCT 360 

GCCAGCTCCG TCTGGGAGCC GCCGGGTTTC CATATGTTTC GCGAGTGCGG ATATGTGCCA 420 

CCTTAATCTC CAGTTTTAGC CGCTCATCAC TTTGTTTTCT GTCTGAGGGT TCATGCTGTA 480 

CCCAGTTGTA ATAACCGCTC CTGGATACAC CAAATACTGA CACATCGCTT CAATGGGAAA 540 

TTGTTGTCGC CATTGTTCGA TTAACGCGTA TTTTTCAGCG ACTCCTGTGC AAAATACGCT 600 

GTTGCTTTTT TTAATATATC TCGCTCAAGG CGAGCTTCAT TTAACGCCTT ACGCAGTCGC 660 

AGAATTTCAG ATTCCAGTTC AGCCACCGTG CGGGAACCAG GAGTACCGAG CCCTTTTCTG 720 

GCGGCGGTAA CCCATTGTCC TAAAGTGCCT TCAGGAAGAG ATAATCGGGA AGCGCCTTCA 780 
CTGATC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 



786 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



AGTTGGACTA 


TAGGTGTACT 


GGCAATACGG 


ACACACCATT 


TGTTCTTTTT 


TTAAGCAGCA 


60 


TCTGATGATA 


TTTTTCCCTG 


AAGGCTGCCG 


GGGAGATATT 


CCCCAGACGA 


GAGTGACGAC 


120 


GCTGACGATT 


GTAGAAAATC 


TCAATGTATT 


CCCGTATTAC 


TGAGATGGCT 


TCATCCCGGT 


180 


TATTAAAACG 


ATAGTGGCTC 


AGGCTCTHAT 


TTTTCAGCGT 


TCCCCANAAG 


CTTTCCATCG 


240 


GAGCGTTGTC 


GTAACAGTTA 


CCTTTACGCG 


ACATTGATGT 


TTTCAGACCA 


AACTGCTCCT 


300 


GTATGACCCG 


GTAATCGTAT 


GCGCAGTACT 


GTGAACCTCG 


ATCAGAGTGG 


TGGATTAGCC 


360 


CGGCAGGNGG 


GCGCTGGCTC 


CTGAGCGCCA 


TAAACAGGGC 


TTTACCTGTC 


AGCTCTTTTG 


420 


TCATGCGCTC 


TCCCATGCGT 


AGCCGACAAT 


TTCGCACGTA 


TAACATCTTT 


GATGCCAGCG 


480 


AGGTACACCA 


TCCCTCCTGT 


GTGGCANCAT 


ACGTCAGGTC 


CGCCACCCAG 


ACCTGATTTG 


540 


GTGCTGTAGG 


AGTGAACGTC 


TGGTTCAGCA 


GATTTGGCGC 


AACTGGCAGA 


TTGTGGTTCG 


600 


GGTTCGTAGT 


CGCTCTGAAC 


TTGCGTTTCT 


GCTTACAGCG 


TASCTTAGCT 


CCTTACGAAG 


660 


ACGTGCCAGT 


CGGTCACGCC 


AACGATGATG 


CCATTCTCTG 


CCAGCTCCGT 


CTGGAGCCGC 


720 


CGGGTTCCAT 


ATGTTDCGCR 


AGTGCGGATA 


TGTGCCACCT 


x ns\ x \+ x k*v*nw 




"Tftft 
/ OU 


TCATCACTTT 


GTTDTCTGTC 


TGAGGGTTCA 


TGCTGTACCC 


AGTTGTAATA 


ACCGCTCCTG 


840 


GATACACCAA 


ATACCTGACA 


CATCGCTTSA 


ATDDDAAATT 


GTTGTCGCCA 


TTGTTCGATT 


900 


AACGCGNNNN 


NNNCAGCGAC 


TCCTGTGCAA 


AATACGCTGT 


TGCTTTTTTT 


AATATATCTC 


960 


GCTCAAGGCG 


AGCTTCATTT 


AACGCTTTAC 


GCAGTTGCAG 


AATTTCAGAT 


TCCAGTTCAG 


1020 


CCACCGTGCG 


GGAACCAGGA 


GTACCGAGCC 


CTTTTCTGGC 


GGCGGTAACC 


CATTGTCCTA 


1080 


AAGTGCCTTC 


AGGAAGAGAT 


AATCGGGAAG 


CGCCTTCGCT 


GATCGAAAGT 


TGATTTTCAA 


1140 


GAACCGTTCT 


GACAGCTTCG 


GCTTTGAACT 


CTGT 






1174 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGTGGCATCA ACAATGGTGC GACCACCGAG CGAGATGAGG TGTACTGGCA ATAGCGGACA 60 
CAAC 64 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATTGTAGAAA 


ATCTCAATGT 


ATTCCCGxAx 


XAUlbnuniu 


CPTTPATCCC 


GGTTATTAAA 

ww X 1 ** 1 1«W» 


60 


ACGATAGTGG 


CTCAGGCTU 1 


riAX LI X lunu 


X X ^twwui 


AAGCTTTCCA 

AAU will W\«r** 


TCGGAGCGTT 


120 


GTCGTAACAG 


TTACuXiiAu 


r*n a ra TTfs a 

VjwwAUaX Xurit 


TflTTTTPAGA 
lu x x x x <— rvvjrv 


CCAAACTGCT 


CCTGTATGAC 


180 


CCGGTAATCG 


lAluCGUiu J. 


& PT^Tf; a a f*r* 

X\^-> lUl Vj/U\V# W- 


t cn. a t pa G A G 


TGGTGGATTA 


GCCCGGCAGG 


240 


NGGGCGCTUU 


wxUwxv7iiw<~w* 




V7wwl X X AWWX 


GTCAGCTCTT 


TTGTCATGCG 


300 


CTCTCCCATG 


CGXAwCC&AL. 


AA1 X 1 UwUnU 




TTTGATGCCA 
x x l w*i x vjc»wn 


GCGAGGTACA 


360 


CCATCCCxUu 


ltflvluutADI 




fi^^fJPPAPP 


CAGACCTGAT 


TTGGTGCTGT 


420 


AGGAGTGAAC 


GTCTGGTTCA 


GCAGATTTGG 


CGtVjCTGGC 


AGATTGTGGT 


TCGGGTTCGT 


480 


AGTCGCTCTG 


AACTTGCGTT 


TCTGCTTACA 


GCGTASCTTA 


GCTCCTTACG 


AAGACGTGCC 


540 


AGTCGGTCAC 


GCCAACGATG 


ATGCCATTCT 


CTGCCAGCTC 


CGTCTGGAGC 


CGCCGGGTTC 


600 


CATATGTTDN 


GCRAGTGCGG 


ATATGTGCCA 


CCTTAATCTC 


CAGTTTDAGC 


CGCTCATCAC 


660 


TTTGTTDTCT 


GTCTGAGGGT 


TCATGCTGTA 


CCCAGTTGTA 


ATAACCGCTC 


CTGGATACAC 


720 


CAAATACCTG 


ACACATCGCT 


TNAATGGGAA 


ATTGTTGTCG 


CCATTGTTCG 


ATTAACGCGN 


780 


ATTTTTCAGC 


GACTCCTGTG 


CAAAATACGC 


TGTTGCTTTT 


TTTNATATAT 


CTCGCTCAAG 


840 


GCGAGCTTCA 


TTTAACGCCT 


TACGCAGTTG 


CAGAATTTCA 


GATTCCAGTT 


CAGCCACCGT 


900 


GCGGGAACCA 


GGAGTACCGA 


GCCCTTTTCT 


GGCGGCGGTA 


ACCCATTGTC 


CTAAAGTGCC 


960 


TTCAGGAAGA 


GNTAATCGGG 


AAGCGCCTTC 


GCTGATCGAA 


AGTTGATTTT 


CAAGAACCGT 


1020 


TCTGACAGCT 


TCGGCTTTGA 


ACTCTTTAGA 


GTAACGTTGG 


TTTTTTCTGC 


TCATTATTAG 


1080 


CTCCTTCTGA 


TGCCATTCTA 


TTTCAGGAAG 


GAGTGTCCGT 


TAAACTCAGG 


CTACCTCAAG 


1140 


ATAAAGTTAT 


TAATTTCGAA 


GATCACATCT 


TCAATAGGTT 


TGCGGTCCAT 


ATTATC 


1196 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
AAAGCACAGA TTTTATAGCT AACTCGATGC TGGTGTGAGG TGTACTGGCA ATAGCGGACA 60 



CTAC 

(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



64 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ATTGTAGAAA 


nlUi Iv/IA X\J X 


Al lUULljJIAl 


1 AC 1 GAG ATG 


GCTTCATCCC 


GGTTATTAAA 


60 


ACGATAGTGG 


\* x Unvu ^i^i, 


flA 11111 C AG 


CGI 1CCCCAN 


AAGCTTTCCA 


TCGGAGCGTT 


120 


w x X nn^nij 


1 1ACC1 1 InU 


G CG AC ATTRA 


TG TTTTCAG A 


CCAGACTRCT 


CCTGTATGAC 


180 


^-v*v*« inni i«w 




AC iGTG AACC 


T CG ATC AG AG 


TGGTGGATTA 


GCCCGGCAGG 


240 


NGGGCGCTGG 


CTCCTGAGCG 


CCATAAACAG 


GGCAAATCCT 


GTCAGCTCTW 


TTGTCATGCG 


300 


CTCTCCCATG 


CGTAGCCGAC 


TAATTTCGCA 


CGTATAACAT 


CTTTGATGCC 


AGCGAGGTAC 


360 


ACCATCCCTC 


CTGTGTGGCA 


NCATACGTCA 


GGTCCGCCAC 


CCAGACCTGA 


TTTGGTGCTG 


420 


TAGGAGCGAA 


CGTCTGGTTC 


AGCAGATTTG 


GCGCAACTGG 


CAGATTGTGG 


TTCGAGTTCG 


480 


TAGTCGCTCT 


GAACTTGCGT 


TTCTGCTTAC 


AGTGTAGCCT 


TAGCTCCTTA 


CGAAGACGTG 


540 


CCAGTCGGTC 


ACGCCAACGA 


TGATGCCATT 


CTCTGCCAGC 


TCCGTCTGGA 


GCCGCCGGGT 


600 


TCCATATGTT 


NCGCRAGTGC 


GGATATGTGC 


CACCTTAATC 


TCCAGTTTDA 


GCCGCTCATC 


660 


ACTTTGTTDT 


CTGTCTGAGG 


GTTCATGCTG 


TACCCAGTTG 


TAATAACCGC 


TCCTGGATAC 


720 


ACCAAATACC 


TGACACATCG 


CTTSAATGGG 


AAATTGTTRT 


CGCCATTGTT 


CGATTAACGC 


780 


GACTCCTCTG 


CAAAATACGC 


TGTTGCTTTT 


TTNNATATAT 


CTCGCTCAAG 


GCGAGCTTCA 


840 


TTTAACGCCT 


TACGCAGTTG 


CAGAATTTCA 


GATTCCAGTT 


CAGCCACCGT 


GCGGGAACCA 


900 


GGAGTACCGA 


GCCCTTTTCT 


GGCGGCGGTA 


ACCCATTGTC 


CTAAAGTGCC 


TTCAGGAAGA 


960 


GATAATCGGG 


AAGCGCCTTC 


ACTGATCGAA 


AGTTGATTTT 


CAGGAACCGT 


TCTGACAGCT 


1020 


TCGGCTTTGA 


ACTCTTKAGA 


GTAACGTTGG 


GTTTTTCTGC 


TCATTATTAG 


CTCCTTCTGA 


1080 


TGCCATTCTA 


TTTCAGGAAG 


GAGTGTCCGT 


TAAACTCAGG 


CTACCTCAGT 


GTGATCGGCG 


1140 


ATAAGCCCAG 


AACTCCGCTC 


CCAGACCTCC 


CTGCCAAAAG 


CAAAACCG 




1188 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATCTCCTTAT GTTATGGAGA TTATTAAAAA GAATAACATT AGCGCTCTCG AACTGCATCG 60 

TGCAATTGTT GAGTTGAGTA AAAATATGAA GTCGATTGAT GATAATGCCA GTAAGAAAAA 120 

CGACAAGTCA TCATTGTATG TATCATGGAC TCTGAGTTTT ACTGCTCCAA CAAGTAAAGA 180 

AGCTCACGAT GTGTTGTCTG GGTATATTAA TTATGTTTCT TCCCTTGTTG TAAGGGATTT 240 

GATGGAAGAT ATAAGAAATA AACTAGAAGT TAAAACTAAT GTTGAAAAAG AAATTCTTGC 300 

ACTGGATGAG ATAAAAATTA GAAACCAGCT GAATGCAGAT ATTCGACNCC TCAATTATTC 360 

ACTGGAGGTT GCTAATGCGG CTGGAATAAA AAAACCTGTA TACAGCAATG GTCAGATTAT 420 

GAAGGATGAC CCAGATTTTC CTGTGGCTCT CGGTTCTGAT GGTATAGCAA CTAAATTGAA 480 

CATCAAAAAA TCAATCAAGG ATGTTTCGGA ATTGAGTGGG GAGTTGCGAA ATCGTCAATA 540 

TGTTGTGAAT CAATTGGTTG TGGCGAAAGN GGGGGANGNN GANNNNANGC MANNNCAGNA 600 

NCAANNGTGC CCAACGNNAC CGGNCAGAAA 630 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCACCACATT GATTGTCTGC CTGAAATAAC ACGAAAGGCA CTGCGTGAAC GCTATGTGGA 60 

ACAGCTGGTG GCTACAGAGA ACAATGTTTC TGAAGTGAAA GCTGTTACCA GAAAAACACG 120 

CAATCCTGAC GCTGTCCAGG CAATCGAAGC ATATCGCGGT TCTCCACAAC TGATGGAAGA 180 

ACGCCTGAAT GCGCTGACCG AAAACCAGCG CTGGGTATCT GAAGCAAGAG CTGCGCTGGT 240 

GGTGGAAGTG CTGAAGCTGG AAAGCGCCGG TAACCCCGGG CGACTGAAAG CCATTAACTT 300 

TCTTGTTGAA AAAGCCCCTA AAGGTGAGCT GCCGGAGCGC CTGCAACAGG CCGCAGTTAA 360 

CGCCAATGCA AAACGTGGCG CTAATCGT 388 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTGACGACCT GGACGTGTAC ACTCGTCTGG GTGGTATGGT TTGGCGTGCA GACACCAAAG 60 

CTCACAACAA TGTGACAGGT GAATCTGAGA AAAACCACGA TACCGGCGTT TCTCCGGTAT 120 

TCGCAGGTGG TGTTGAATGG GCCATCACTC CTGAAATCGC TACCCGTCTG GAATACCAGT 180 

GGAC 184 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(.*) I.2NGTH: 169 base pairs 

TYPE: nucleic acid 
{Z> 5TRANDEDNESS: double 
(I) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CTGACGACCT GGACATCTAC ACTCGTCTGG GTGGCATGGT ATGGCGTGCA GACACTAAAT 60 
CCAACGTTTA TGGTAAAAAC CACGACACCG GCGTTTCTCC GGTCTTCGCT GGCGGTGTTG 120 
AGTACGCGAT CACTCCTGAA ATCGCTACCC GTCTGGAATA CCAGTGGAC 169 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTGCAGCGCC TCTACTACCG GATACAGCCT CCATT 35 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCTCCTTCAG GGCGGATTCC AGCCGTTCAC AITGT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCGATCTTCT ATTGTACGAC GGTGTTCGTC AAAAGCTAAT 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGGTGAACA ACGTCTTACA AAGATGGTTC CTGGATGGAT T 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGTCTTTCCG TGTTTCTCAG AAATGGGGGC AACGTGCAAA A 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCACCGTTGA AGCGTAAACC GTTTGACCGA TGGAT 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCTGGGGTCT ACAGGTGCAA TAACCACTTA GACGGT 36 
(2) INFORMATION FOR SEQ ID NO: 21: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGATGATGCC ATTCTCTGCC AGCTCCGTCT GGGAGCCGCC GGGTTTCC 48 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGAGCAGTCT GGTCTGA 17 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CCTGTGGCTC TCGGTTCTGA TGGTATAGCA ACTAAAT 37 
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(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CAAGGATGTT TCGGAATTGA GTGGGGAGTT GCGAAAT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CAGGCAATCG AAGCATATCG CGGTTCTCCA CAACT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TGAATGCGCT GACCGAAAAC CAGCGCTGGG TATCT 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTGATGGCCC ATTCAACACC ACCTGCGAAT ACCGG 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CTCAGATTCA CCTGTCACAT TGTTGTGAGC TTTGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
( 5 ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
AATCCATCCA GGAACCATCT TTGTAAGACG TTGTTCACCA G 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TTTTGCACGT TGCCCCCATT TCTGAGAAAC ACGGAAAGAC T 
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CLAIMS 

1. An exclusively chromosomal Shigella specific 
fragment identified by subtractive 
hybridization using S. flexneri as "target" DNA 

05 and a complex DNA competitor mix as competitor 

DNA. 

2. A Shigella specific fragment identified by 
subtractive hybridization using S. sonnei as 
"target" DNA and a mixture of non-pathogenic E . 

10 coli YMC DNA and pBR322 vector DNA as 

competitor DNA, 

3. A Shigella specific fragment derived from a 
chromosomal sequence of Shigella sonnei, the 
Shigella specific fragment selected from the 

15 group consisting of NT-6 (nucleotides 1-124 of 

SEQ ID NO:l), NT11-2 (SEQ ID NO:2), NT14 (SEQ 
ID NO: 4) and NT15 (SEQ ID NO: 3). 

4. A Shigella specific fragment derived from a 
chromosomal sequence of Shigella flexneri, the 

20 Shigella specific fragment selected from the 

group consisting of NT18-la (SEQ ID NO: 10) and 
NT19-2 (SEQ ID NO: 11) . 

5. A probe for the detection of a member of the 
genus Shigella wherein the probe is capable of 

25 detecting at least one untyped isolate or at 

least one isolate of a serotype of each of the 
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four species of the genus Shigella selected 
from the group consisting of: 

a) (SEQ ID NO: 1) ; 

b) probes derived from SEQ ID NO:l; 
05 c) NT11-2 (SEQ ID NO:2); 

d) probes derived from NT11-2; and 

e) homologues of (a) through (d) . 

6. A probe of Claim 5 for the detection of a 

member of the genus Shigella wherein the probe 

10 is capable of detecting at least one untyped 

isolate or at least one isolate of a serotype 
of each of the four species of the genus 
Shigella, wherein the probe is selected from 
the group consisting of: 

15 a) 1500 (SEQ ID NO: 14) , 

5 ' -TTGCAGCGCCTCTACTACCGGATACAGCCTCCATT-3 ' ; 

b) 1501 (SEQ ID NO: 15) , 

5 ' -CCTCCTTCAGGGCGGATTCCAGCCGTTCACATTGT-3 ' ; 

c) 1911 (SEQ ID N0:l6) , 

20 5 ' -CCGATCTTCTATTGTACGACGGTGTTCGTCAAA- 

AGCTAAT-3 ' ; 

d) 1682 (SEQ ID NO: 17) , 

5 ' -CTGGTGAACAACGTCTTACAAAGATGGTTCCTG- 
GATGGATT-3 ' ; 
25 e) 1683 (SEQ ID NO: IS) , 

5 ' -AGTCTTTCCGTGTTTCTCAGAAATGGGGGCAAC- 
GTGCAAAA-3 9 ; 

f ) 1708 (SEQ ID NO:19) , 

5 ' -CCACCGTTGAAGCGTAAACCGTTTGACCGATGG- 
30 AT- 3 ' ; and 

g) complements of (a) Through (f ) . 
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7. A probe capable of detecting at least one 

isolate of each of serotypes 1-5 of Shigella 
flexneri selected from the group consisting of: 
a) NT18-la (SEQ ID NO: 10); 
05 b) probes derived from NT18-la; 

c) homologues of (a) and (b) ; 

d) NT19-2 (SEQ ID NO; 11); 

e) probes derived from NT19-2; and 

f ) homologues of (d) and (e) . 

10 8. A probe of Claim 7 capable of detecting of at 
least one isolate of each of serotypes 1-5 of 
Shigella flexneri wherein the probe is selected 
from the group consisting of: 

a) 1712 (SEQ ID NO:23) , 

15 5 ' -CCTGTGGCTCTCGGTTCTGATGGTATAGCAACT- 

AAAT-3 ' ; 

b) 1713 (SEQ ID NO: 24) , 

5 ' -CAAGGATGTTTCGGAATTGAGTGGGGAGTTGCG- 
AAAT-3 ' ; and 
20 c) complements of (a) and (b) . 

9. A probe of Claim 7 (d) through (f) which 

further detects a member of the genus Shigella 
selected from the group consisting of S. boydii 
serotype 5, S. boydii serotype 7, S. boydii 
25 serotype 9, S. boydii serotype 11, S. boydii 

serotype 16, S. boydii serotype 17, and S. 
sonnei serotype 1. 
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10. A probe of Claim 9 selected from the group 
consisting of: 
a) 1684 (SEQ ID NO:25) , 

5 ' -CAGGCAATCGAAGCATATCGCGGTTCTCCACAACT-3 ' ; 
05 b) 1685 (SEQ ID NO: 26) , 

5 ' -TGAATGCGCTGACCGAAAACCAGCGCTGGGTATCT-3 ' ; 
and 

c) complements of (a) and (b) . 



10 



15 



11. An oligonucleotide probe for the detection of 
at least one isolate of Shigella selected from 
the group consisting of dysenteriae 
serotypes 1 and 2, and S. ooydii serotypes 5, 
7, 9, 11, 12, 15, 16, and 17, the 
oligonucleotide probe comprising a sequence 
selected from the group consisting of: 

a) 1707 (SEQ ID N0:28) , 

5 7 -CTCAGATTCACCTGTCACATTGTTGTGAGCTTTGG-3 *' ; 

b) 1706 (SEQ ID NO:27) ; 

5 ' -GTGATGGCCCATTCAACACCACCTGCGAATACCGG-3 ' ; 
20 c) complements of (a) and (b) ; and 

d) homologues of (a) through (c) . 

12. A substantially inclusive oligonucleotide probe 
for the detection of a member of the genus 
Shigella comprising a sequence selected from 

25 the group consisting of: 

a) 437 (SEQ ID NO:21) , 

5- 7 CGATGATGCCATTCTCTGCCAGCTCCGTCTGG- 
GAGCCGCCGGGTTTCC-3 ' ; 

b) 1864 (SEQ ID NO:22) , 

30 5 ' -GGAGCAGTCTGGTCTG A- 3 ' ; 
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c) complements of (a) and (b) ; and 

d) homologues of (a) and (b) . 

13. A substantially inclusive capture/detector 
probe pair for the detection of a member of the 

05 genus Shigella, comprising a substantially 

inclusive capture probe and a detector probe 
wherein the capture probe is selected from the 
group consisting of: 
a) 1864 (SEQ ID NO:22); and 

10 b) complement of 1864; 

the detector probe having an oligonucleotide 
sequence derived from the same strand of the S. 
sonnei sequence comprising fragments NT14 (SEQ 
ID NO: 4) and NT15 (SEQ ID NO: 3) as the capture 

15 probe. 

14. A substantially inclusive capture/detector 
probe pair of Claim 13, the probe pair also 
being exclusive of exclusivity organisms 
commonly found in stool and substantially 

20 exclusive of non-EIEC Enterobacteriaceae. 

15. A capture/detector probe pair of Claim 13 
wherein the capture probe is 18 64 (SEQ ID 

NO: 22) and the detector probe is the complement 
of 437. 

25 16. A capture/detector probe pair of Claim 13 

wherein the capture probe is the complement of 
1864 and the detector probe is probe 437 (SEQ 
ID NO:21) . 
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17. A substantially inclusive probe set for the 
detection of a member of the genus Shigella, 
which further detects EIEC, comprising at least 
three capture/detector probe pairs, the first 
05 pair of probes selected from the group 

consisting of oligonucleotides having sequences 
derived from a selected strand of SEQ ID NO:l 
and homologues thereof, the second pair of 
probes selected from the group consisting of 
10 oligonucleotides having sequences derived from 

a selected strand of fragment NT19-2 (SEQ ID 
NO: 11) and homologues thereof, and the third 
pair of probes selected from the group 
consisting of oligonucleotides having sequences 
15 derived from a selected strand of the S. 

dysenteriae ompA sequence which spans 
nucleotides from position 893 through 1076 (SEQ 
ID NO: 12) and homologues thereof. 

A substantially inclusive probe set of Claim 
17, comprising at least three capture/detector 
probe pairs, the first pair of probes having 
oligonucleotide sequences derived from a 
selected strand of SEQ ID NO:l, the second pair 
of probes having oligonucleotide sequences 
derived from a selected strand of fragment 
NT19-2 (SEQ ID NO: 11), and the third pair of 
probes having oligonucleotide sequences derived 



20 



25 
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from a selected strand of the S. dysenteriae 
ompA sequence which spans nucleotides from 
position 893 through 1076 (SEQ ID NO: 12). 

19. A substantially inclusive probe set of Claim 
05 18, the probe set also being exclusive of 

exclusivity organisms commonly found in stool 
and substantially exclusive of non-EIEC 
Enterobacter iaceae * 

20. A substantially inclusive probe set of Claim 19 
10 wherein the first pair of probes is selected 

from the group consisting of probes 1911 (SEQ 
ID NO: 16) , 1500 (SEQ ID NO: 14) and 1501 (SEQ ID 
NO: 15), the second pair of probes consists of 
probes 1684 (SEQ ID N0:25) and 1685 (SEQ ID 
15 N0:26), and the third pair of probes consists 

of probes 1706 (SEQ ID N0L:27) and 1707 (SEQ ID 
N0:28) . 

21. A substantially inclusive probe set for the 
detection of a member of the genus Shigella, 

20 which further detects EIEC, comprising at least 

three capture/ detector probe pairs, the first 
pair selected from the group consisting of: 

a) 1911 (SEQ ID NO: 16) ; 

b) 1500 (SEQ ID N0:14); and 
25 c) 1501 (SEQ ID NO: 15) ; 

the second pair selected from the group 
consisting of: 

a) 1684 (SEQ ID N0:25); and 
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b) 1685 (SEQ ID NO: 26) ; 

the third pair selected from the group 

consisting of: 

a) 1706 (SEQ ID NO:27); and 
05 b) 1707 (SEQ ID NO:28) . 

22. A substantially inclusive probe set for the 
detection of a member of the genus Shigella, 
which further detects EIEC, comprising at least 
two capture/detector probe pairs, the first 

10 P a i** °f probes selected from the group 

consisting of oligonucleotides having sequences 
derived from a selected strand of the sequence 
comprising S. sonnei fragments NT14 (SEQ ID 
NO: 4) and NT15 (SEQ ID NO: 3) and homologues 

15 thereof, and the second pair of probes selected 

from the group consisting of oligonucleotides 
having sequences derived from a selected strand 
of, the S. dysenteriae ompA sequence which spans 
nucleotides from position 893 through 1076 (SEQ 

20 ID NO: 12) and homologues thereof. 

23. A substantially inclusive probe set of Claim 
22, comprising at least two capture/detector 
probe pairs, the first pair of probes having 
oligonucleotide sequences derived from a 

25 selected strand of the sequence comprising S. 

sonnei fragments NT14 (SEQ ID NO:4) and NT15 
(SEQ ID NO:3), and the second pair of probes 
having oligonucleotide sequences derived from a 
selected strand of the S. dysenteriae ompA 

30 sequence which spans nucleotides from position 

S93 through 1076 (SEQ ID NO:12). 
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24. A substantially inclusive probe set of Claim 
23, the probe set also being exclusive of 
exclusivity organisms commonly found in stool 
and substantially exclusive of non-EIEC 

05 Enterobacter iaceae . 

25. A substantially inclusive probe set of Claim 24 
wherein the first pair of probes consists of 
probes 1864 (SEQ ID NO:22) and the complement 
of 437, and the second pair of probes consists 

10 of probes 1706 (SEQ ID NO: 27) and 1707 (SEQ ID 

NO:28) . 

26. A substantially inclusive probe set for the 
detection of a member of the genus Shigella, 
which further detects EIEC, comprising at least 

15 two capture/ detector probe pairs, wherein the 

first pair is 1864 (SEQ ID NO: 22) and the 
complement of 437 and the second pair is 1706 
(SEQ ID NO:27) and 1707 (SEQ ID NO:28). 

27. An oligonucleotide probe derived from Shigella 
20 specific fragment NT-6 (nucleotides 1-124 of 

SEQ ID NO: 1) which substantially retains the 
inclusivity behavior of NT-6. 

28. An oligonucleotide probe of Claim 27 which has 
improved exclusivity behavior for non-EIEC 

25 Enterobacteriaceae and is exclusive of the 

exclusivity organisms commonly found in stool. 
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29. An oligonucleotide probe of Claim 28 comprising 
a sequence selected from the group consisting 
of: 

a) 1911 (SEQ ID NO: 16) , 

05 5 / -CCGATCTTCTATTGTACGACGGTGTTCGTCAAA- 

AGCTAAT-3 ' ; 

b) 1500 (SEQ ID NO: 14) , 

5 ' -TTGCAGCGCCTCTACTACCGGATACAGCCTCCATT-3 ' ; 

c) 1501 (SEQ ID NO: 15) , 

10 5 ' -CCTCCTTCAGGGCGGATTCCAGCCGTTCACATTGT-3 ' ; 

d) complements of (a) through (c) ; and 

e) homologues of (a) through (d) . 

30. An oligonucleotide probe derived from Shigella 
specific fragment NT11-2 (SEQ ID NO: 2) which 

15 substantially retains the inclusivity behavior 

of NT11-2. 

31. An oligonucleotide probe of Claim 30 which 
substantially retains the exclusivity behavior 
of NT11-2 (SEQ ID NO: 2) towards non-EIEC 

20 Enterobacteriaceae . 
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32. An oligonucleotide probe of Claim 31 comprising 
a sequence selected from the group consisting 
of: 

a) 1682 (SEQ ID NO: 17) , 

5 ' -CTGGTGAACAACGTCTTACAAAGATGGTTCCTG- 
GATGGATT-3 ' ; 

b) complement of 1682 (SEQ ID NO:29), 
5 ' -AATCCATCCAGGAACCATCTTTGTAAGACGTTG- 

TTCACCAG-3'; and 

c) homologues of (a) and (b) . 

An oligonucleotide probe derived from Shigella 
specific fragment NT11-2 (SEQ ID NO: 2) which 
moderately retains the inclusivity behavior of 
NT11-2- 

15 34* An oligonucleotide probe of Claim 33 which 

substantially retains the exclusivity behavior 
of NT11-2 (SEQ ID NO:2) towards non-EIEC 
Enterobacteriaceae . 

35. An oligonucleotide probe of Claim 34 
20 comprising a sequence selected from the group 

consisting of: 
a) 1683 (SEQ ID NO: 18) , 

5 ' -AGTCTTTCCGTGTTTCTCAGAAATGGGGGCAAC- 
GTGCAAAA-3 ' ; 

25 b) complement of 1683 (SEQ ID NO:30), 

5 / -TTTTGCACGTTGCCCCCATTTCTGAGAAACACGG- 
AAAGACT-3 ' ; and 
c) homologues of (a) and (b) . 



05 
10 

33. 
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36. An oligonucleotide probe derived from Shigella 
specific fragment NT11-2 (SEQ ID NO: 2) which 
partially retains the inclusivity behavior of 
NTll-2. 

05 37. An oligonucleotide probe of Claim 36 which 

substantially retains the exclusivity behavior 
of NTll-2 (SEQ ID NO: 2) towards non-EIEC 
Enterobacteriaceae . 

An oligonucleotide probe of Claim 37 comprising 
a sequence selected from the group consisting , 
of: 

a) 1708 (SEQ ID NO: 19) , 
5 ' -CCACCGTTGAAGCGTAAACCGTTTGACCGATGG- 

AT-3'; and 

b) 1709 (SEQ ID NO:20) , 
5 ' -GCTGGGGTCTACAGGTGCAATAACCACTTAGAC- 

GGT'-3' ; 

c) complements of (a) and (b) ; and 

d) homologues of (a) through (c) . 

An oligonucleotide probe derived from Shigella 
specific fragment NT18-la (SEQ ID NO: 10) which 
substantially retains the inclusivity behavior 
of NT18-la. 

An oligonucleotide probe of Claim 39 which 
substantially retains the exclusivity behavior 
of NT18-la (SEQ ID NO: 10) towards non-EIEC 
Enterobacteriaceae . 



38 . 

10 



15 



20 39. 



40. 

25 
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41. An oligonucleotide probe of Claim 40 comprising 
a sequence selected from the group consisting 
of: 

a) 1712 (SEQ ID NO:23) , 

05 5 ' -CCTGTGGCTCTCGGTTCTGATGGTATAGCAACT- 

AAAT-3 ' ; 

b) 1713 (SEQ ID NO:24) , 

5 ' -CAAGGATGTTTCGGAATTGAGTGGGGAGTTGCG- 

AAAT-3 ' ; 

10 c) complements of (a) and (b) ; and 

d) homologues of (a) through (c) . 

42. An oligonucleotide probe derived from Shigella 
specific fragment NT19-2 (SEQ ID NO: 11) which 
substantially retains the inclusivity behavior 

15 of NT19-2. 

43. An oligonucleotide probe of Claim 42 which 
substantially retains the exclusivity behavior 
of NT19-2 (SEQ ID NO: 11) towards non-EIEC 
Enterobacter iaceae . 

20 44. An oligonucleotide probe of Claim 43 comprising 
a sequence selected from the group consisting 
of: 

a) 1684 (SEQ ID NO:25) , 

5 ' -CAGGCAATCGAAGCATATCGCGGTTCTCCACAACT-3 ' ; 

25 b) 1685 (SEQ ID NO:26) , 

5 ' -TGAATGCGCTGACCGAAAACCAGCGCTGGGTATCT-3 ' ; 

c) complements of (a) and (b) ; and 

d) homologues of (a) through (c) . 
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1 

biotinylate by nick-translation 
method 



Denature and Hybridize 
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37°C, Overnight 
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(pretreated w/IOO^g denatured 
Salmon Sperm DNA) 



FIG. 1 



Repeat 3-4 times with fresh competitor 
DNA; Stop cycling when hybridization 
signal of 32p target DNA to spotted 
competitor DNA is negligible 



Use enriched target DNA to probe library of 
Target DNA clones (e.g., in pUC18) 
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